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Description 

This invention relates to a bone morphogenetic protein (BMP) which initiates cartilage and bone growth. 
Th8 full length coding sequence for BMP, a polypeptide, is provided. The BMP is provided by isolation 
s from bone sources and by synthesis using recombinant DNA techniques. 

BACKGROUND OF THE INVENTION 

it is known that demineralized bone matrix induces new bone formation when implanted in the soft 
to tissue by a process generally designated as matrix induced bone formation (see Urist, M.R., Science , 150 : 
.893-899 (1965)). There have been numerous efforts to extract and identify the active material (or materials) 
which induces this process, and it has been generally referred to in the literature as bone morphogenetic 
protein(s) (BMP). It is uncertain whether BMP is a single material or a mixture of materials, and there does 
not appear to be agreement among the investigators as to which material, if any, is the bone morphogenetic 
75 protein. As discussed herein, the term BMP is used to describe the protein having the amino acid sequence 
shown in FIG. 3 (bovine BMP) or FIG. 5 (human BMP), without the signal peptide. 

The therapeutic use of BMP offers considerable advantages over use of traditional bone graft materials. 
While not intended to be limited by any theory one hypothesis assumes that BMP initiates the differenti- 
ation of tissue cells into osteoblasts (cells that manufacture bone). During a process that replicates normal 
20 human fetal development, BMP-induced osteoblasts form cartilage which, over a period of several weeks 
yields solid bone. Thus BMP may be useful for replacing bone that has been destroyed by disease or 
accident, for use in treatment of scoliosis victims, for treatment of mal- or mis-formed bone, for use in 
healing of a fracture, dental reconstruction, hip replacement, bone remodeling, and control of osteoporosis. 
It is thus an object of the present invention to produce a functional cartilage and bone growth factor or a 
25 component thereof, which is a protein identified by its entire amino acid sequence, which has BMP activity. 
It is another object of the present invention to produce this biologically active BMP protein by 
recombinant DNA technology. 

SUMMARY OF THE INVENTION 

30 

The present invention provides a class of mature native mammalian proteins termed herein as "bone 
morphogenetic protein" or "BMP", exemplified by the native human and bovine sequences described 
herein. Generally, this class of proteins induce bone growth in vivo or in vitro . The human and/or bovine 
sequences are representative of the class, and can be used to identify and isolate other mammalian BMP 

35 proteins, which will be at least partially or substantially homologous in nucleotide and amino acid 
sequences. Those of skill in the art will be able to readily identify other mammalian BMP's based on 
sequence homology to the human and bovine proteins disclosed herein, as well as biological activity. It is 
recognized that there may be allelic variations in BMP within a species, and such allelic variants are also 
within the scope of the class of proteins provided by the present invention. 

40 The present invention further provides polypeptides which are analogs of BMP, such as BMP muteins, 
fusion proteins comprising BMP or BMP domains, and BMP fragments. Preferred analogs have BMP 
activity. A BMP mutein is a protein substantially homologous to a native BMP sequence (e.g., a minimum of 
about 75%. 85%, 90% or 95% homologous) wherein at least one amino acid is different. The term fusion 
protein includes a protein comprising a complete BMP sequence or a domain, and a heterologous N- or C- 

45 terminal sequence (such as a signal sequence or sequence which protects the protein from degradation). A 
BMP fragment or domain is an amino acid sequence of sufficient length from a BMP protein such that it is 
identifiable as having been derived from such BMP protein. The origin of a particular peptide can be, 
determined, for example by comparing its sequence to those in public databases. 

The present invention provides in another embodiment BMP having amino acid sequence shown in 

50 FIGS. 3 and 5 (which also show the signal peptide). The present invention also provides methods of 
preparing the BMP by recombinant DNA techniques. 

In yet another embodiment, the present invention provides the DNA sequence encoding BMP or 
analogs thereof, which may be us d to construct vectors for xpr ssion in host systems by recombinant 
DNA techniques. 

55 In still another embodiment, the present invention provides therapeutic compositions comprising BMP 
and, optionally, other osteoinductive associated factors such as matrix Gla protein (MGP) and bon 
calcification factor (BCF) and methods for forming cartilage and bone in vertebrates by introducing in vivo at 
th desired sit an ffectiv bon initiating amount of BMP. Th identity of BMP was first r ported by Price, 
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Urist and Otawara in Biochem. Biophys. R s. Comm . 117:765-771 (1983). Th identity of BCF is disclosed 
in commonly assigned copending Serial No. 360,826, fil d June 2, 1989. 

BRIEF DESCRIPTION OF THE DRAWINGS 

5 

FIG. 1 shows the amino acid sequences of eight peptides from tryptic digestion of bovine BMP; 
FIG. 2 shows DNA and amino acid sequences of two exon-containing regions of the bovine BMP gene; 
FIG. 3 shows the nucleotide sequence of bovine BMP cDNA and the deduced amino acid sequence of 
the precursor polypeptide; 

70 FIGS. 4a through 4c show the sequences of the exon-containing regions of the human BMP gene. 

FIG. 5 shows the nucleotide sequence of human BMP and the deduced amino acid sequence of the 
precursor polypeptide. 

DETAILED DESCRIPTION OF THE INVENTION 

75 

The BMP according to the present invention may be obtained, free of other osteoinductive associated 
factors, directly from bone sources, by preparative peptide synthesis using chemical methods (such as the 
Merrifield synthesis method) or by recombinant DNA technology. 

As more particularly described in Example 1, BMP may be obtained from partially purified human, 
20 bovine, or other vertebrate bone extracts, by preparative gel electrophoresis and electroelution of the 
protein. 

BMP nucleic acid sequences may be obtained by recombinant DNA methods, such as by screening 
reverse transcripts of mRNA, or by screening genomic libraries from any cell. The DNA may also be 
obtained by synthesizing the DNA using commonly available techniques and DNA synthesizing apparatus. 

25 Synthesis may be advantageous because unique restriction sites may be introduced at the time of 
preparing the DNA, thereby facilitating the use of the gene in vectors containing restriction sites not 
otherwise present in the native source. Furthermore, any desired site modification in the DNA may be 
introduced by synthesis, without the need to further modify the DNA by mutagenesis. 

In general, DNA encoding the BMP may be obtained from human, bovine or other sources by 

30 . constructing a cDNA library from mRNA isolated from vertebrate tissue; and screening with labeled DNA 
probes encoding portions of the human or bovine chains in order to detect clones in the cDNA library that 
contain homologous sequences; or by polymerase chain reaction (PCR) amplification of the cDNA (from 
mRNA) and subcloning and screening with labeled DNA probes; and then analyzing the clones by 
restriction enzyme analysis and nucleic acid sequencing so as to identify full-length clones and, if full-length 

35 clones are not present in the library, recovering appropriate fragments from the various clones and ligating 
them at restriction sites common to the clones to assemble a clone encoding a full-length molecule. 
Particularly preferred DNA probes are set forth in the accompanying examples. Any sequences missing 
from the 5' end of the BMP cDNA may be obtained by the 3* extension of the synthetic oligonucleotides 
complementary to BMP sequences using mRNA as a template, (so-called primer extension), or homologous 

40 sequences may be supplied from known cDNAs derived from human or bovine sequences as shown in FIG. 
3 or FIG. 5. 

The practice of the present invention will employ, unless otherwise indicated, conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Maniatis, Fritsch & Sambrook, "Molecular Cloning: A Laboratory 

45 Manual" (1982); "DNA Cloning: A Practical Approach," Volumes I and II (D.N. Clover ed. 1985); 
"Oligonucleotide Synthesis" (M.J. Gait ed. 1984); "Nucleic Acid Hybridization" (B.D. Hames & S.J. Higgins 
eds. 1985); "Transcription And Translation" (B.D. Hames & S.J. Higgins eds. 1984); "Animal Cell Culture" 
(R.I. Freshney ed. 1986); "Immobilized Cells And Enzymes" (IRL Press, 1986); B. Perbal, "A Practical 
Guide To Molecular Cloning" (1984). 

50 In describing the present invention, the following terminology will be used in accordance with the 
definitions set out below. 

The term "osteoinductive associated factors" includes factors known in the art which are present in 
mammalian bone or other mammalian tissue and t nd to co-purify with BMP or BMP activity. Such factors . 
include, but are not limited to, proteins which have been isolated from bon having reported relative 
55 molecular weights by migration on a SDS-PAGE of 34KD, 24KD, 18.5KD, 17.5KD, 16.5KD, 14KD (as cited in 
the U.S. Pat nt No. 4,761.471), and 6KD (reported by Price, PA, et al., from Prot. Natl. Acad. Sci. , 73, pp. 
1447-1451. 3976). All observed molecular weights ar report d her in as relativ mol cular w ight by 
migration on SDS-PAGE gel. 
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A "replicon" is any g netic el ment (e.g., plasm id, chromosome, virus) that functions as an autonomous 
unit of DNA replication in vivo ; i.e., capable of r plication under its own control. 

A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA segment may be 
attached so as to bring about the replication of the attached segment. 

5 A "DNA molecule" refers to the polymeric form of deoxy ribonucleotides (adenine, guanine, thymine, or 
cytosine) in its either single stranded form, or a double-stranded helix. This term refers only to the primary 
and secondary structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
term includes double-stranded DNA found, inter alia , in linear DNA molecules (e.g., restriction fragments), 
viruses, plasmids, and chromosomes. In discussing the structure of particular double-stranded DNA 

to molecules, sequences may be described herein according to the normal convention of giving only the 
sequence in the 5' to 3* direction along the nontranscribed strand of DNA (i.e., the strand having a sequence 
homologous to the mRNA). 

A DNA "coding sequence" is a double-stranded DNA sequence which is transcribed and translated into 
a polypeptide in vivo when placed under the control of appropriate regulatory sequences. The boundaries of 

75 the coding sequence are determined by a start codon at the 5' (amino) terminus and a translation stop 
codon at the 3* (carboxy) terminus. A coding sequence can include, but is not limited to, procaryotic 
sequences, cDNA from eucaryotic mRNA, genomic DNA sequences from eucaryotic (e.g., mammalian) 
DNA, and even synthetic DNA sequences. A polyadenylation signal and transcription termination sequence 
will usually be located 3* to the coding sequence. 

20 Transcriptional and translation^ control sequences are DNA regulatory sequences, such as promoters, 
enhancers, polyadenylation signals, terminators, and the like, that provide for the expression of a coding 
sequence in a host cell. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA polymerase in a cell and 
initiating transcription of a downstream (3* direction) coding sequence. For purposes of defining the present 

25 invention, the promoter sequence is bounded at its 3' terminus by the transcription initiation site and 
extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate 
transcription at levels detectable above background. Within the promoter sequence will be found a 
transcription initiation site (conveniently defined by mapping with nuclease S1), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. Eucaryotic promoters will 

30 often, but not always, contajn "TATA" boxes and "CAT" boxes. Procaryotic promoters contain Shine- 
Dalgarno. sequences in addition to the -10 and -35 consensus sequences. 

A coding sequence is "under the control" of transcriptional and translational control sequences in a ceil 
when RNA polymerase transcribes the coding sequence into mRNA, which is then translated into the 
protein encoded by the coding sequence. 

35 A "signal sequence" can be included before the coding sequence. This sequence encodes a signal 
peptide, N-terminal to the polypeptide, that communicates to the host cell to direct the polypeptide to the 
cell surface or secrete the polypeptide into the media, and this signal peptide is clipped off by the host cell 
before the protein leaves the cell. Signal sequences can be found associated with a variety of proteins 
native to prokaryotes and eukaryotes. For instance, alpha-factor, a native yeast protein, is secreted from 

40 yeast, and its signal sequence can be attached to heterologous proteins to be secreted into the media (See 
U.S. Patent 4,546,082, EPO 0 116 201, publication date 12 January 1983; U.S. Patent Application Ser. No. 
522,909, filed 12 August 1983). Further, the alpha-factor and its analogs have been found to secrete 
heterologous proteins from a variety of yeast, such as Saccharomyces and Kluyveromyces, (EPO 
88312306.9 filed 23 December 1988; U.S. Patent Application Ser. No. 339,682, filed 30 December 1987, 

45 and EPO Pub. No. 0 301 669, publication date 1 February 1989). 

A cell has been "transformed" by exogenous or heterologous DNA when such DNA has been 
introduced inside the cell. The transforming DNA may or may not be integrated (covalently linked) into 
chromosomal DNA making up the genome of the cell. In prokaryotes, yeast, and mammalian cells for 
example, the transforming DNA may be maintained on an episomal element such as a plasmid. With 

so respect to eukaryotic cells, a stably transformed cell is one in which the transforming DNA has become 
integrated into a chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones comprised of a 
population of daughter c lis containing the transforming DNA. A "clone" is a population of cells d rived 
from a single cell or common anc stor by mitosis. A "cell line" is a clon of a primary cell that is capabt 

55 of stable growth in vitro for many generations. 

Two DNA sequences ar "substantially homologous" when at least about 85% (preferably at least 
about 90%, and most pr ferably at least about 95%) of the nucleotides match ov r th defin d length of th 
DNA sequenc s. Sequences that are substantially homologous can be identified in a Southern hybridization 
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experiment under, for xample, stringent conditions as defined for that particular system. Defining appro- 
priat hybridization conditions is within th skill of the art. Se , _.g., Maniatis _t al., supra ; DNA Cloning, 
Vols. I & II, supra ; Nucleic Acid Hybridization, supra . 

A "heterologous" region of the DNA construct is an identifiable segment of DNA within a larger DNA 

5 molecule that is not found in association with the larger molecule in nature. Thus, when the heterologous 
region encodes a mammalian gene, the gene will usually be flanked by DNA that does not flank the 
mammalian genomic DNA in the genome of the source organism. Another example of a heterologous 
coding sequence is a construct where the coding sequence itself is not found in nature (e.g., a cDNA where 
the genomic coding sequence contains introns, or synthetic sequences having codons different than the 

70 native gene). Allelic variations or naturally-occurring mutational events do not give rise to a heterologous 
region of DNA as defined herein. 

A composition comprising "A" (where "A" is a single protein, DNA molecule, vector, etc.) is substan- 
tially free of "B" (where "B" comprises one or more contaminating proteins, DNA molecules, vectors, etc.) 
when at least about 75% by weight of the proteins, DNA, vectors (depending on the category of species to 

75 which A and B belong) in the composition is "A". Preferably, "A" comprises at least about 90% by weight 
of the A + B species in the composition, most preferably at least about 99% by weight. It is also preferred 
that a composition, which is substantially free of contamination, contain only a single molecular weight 
species having the activity or characteristic of the species of interest. 

An "antibody" is any immunoglobulin, including antibodies and fragments thereof, that binds a specific 

20 epitope. The term encompasses, inter alia, polyclonal, monoclonal, and chimeric antibodies. 
For more about chimeric antibodies see U.S. Patents Nos., 4,816,397 and 4,816,567. 
The intron-free DNA provided by the present invention is novel, since it is believed that the naturally- 
occurring human and bovine genes contain introns. Hence, the term "intron-free" excludes the DNA 
sequences which naturally occur in the chromosomes of human or bovine cells. The present invention also 

25 encompasses the intron-free cDNA sequences derivable from the DNA sequences disclosed herein. 

As more particularly described in the following examples, human and bovine cDNA libraries were 
initially probed for sequences encoding BMP sequences using labeled oligodeoxynucleotides whose 
sequences were based on a partial amino acid sequence determined from analysis of purified, protein 
samples derived from bone described herein. However, it is realized that once being provided with intron- 

30 free DNA encoding human and bovine BMP and their leader sequences as described herein, one of 
ordinary skill in the art would recognize that other precisely hybridizing probes may be prepared from the 
described sequences in order to readily obtain the remainder of the desired human or bovine gene. 

Vectors are used to simplify manipulation of the DNA which encodes the BMP polypeptide, either for 
preparation of large quantities of DNA for further processing (cloning vectors) or for expression of the BMP 

35 polypeptide (expression vectors). Vectors comprise plasmids, viruses (including phage), and integratable 
DNA fragments, i.e., fragments that are integratable into the host genome by recombination. Cloning vectors 
need not contain expression control sequences. However, control sequences in an expression vector 
include transcriptional and translational control sequences such as a transcriptional promoter, an optional 
operator sequence to control transcription, a sequence encoding suitable ribosome binding sites (for 

40 prokaryotic expression), and sequences which control termination of transcript ton and translation. The 
expression vector should preferably include a selection gene to facilitate the stable expression of BMP 
and/or to identify transformants. However, the selection gene for maintaining expression can be supplied by 
a separate vector in cotransformation systems using eukaryotic host cells. 

Suitable vectors generally will contain replicon (origins of replication, for use in non-integrative vectors) 

45 and control sequences which are derived from species compatible with the intended expression host. By 
the term "replicable" vector as used herein, it is intended to encompass vectors containing such replicons 
as well as vectors which are replicated by integration into the host genome. Transformed host cells are cells 
which have been transformed or transfected with vectors containing BMP encoding DNA. The expressed 
BMP will be deposited intracellular^ or secreted into either the periplasmic space or the culture super- 

50 natant, depending upon the host cell selected and the presence of suitable processing signals in the 
expressed peptide, e.g. homologous or heterologous signal sequences. 

. Suitable host cells are prokaryotes or eukaryotic cells. Prokaryotes include Gram negative or Gram 
positive organisms, for exampl E. coli or bacilli. Eukaryotic celts includ yeast or higher ukaryotic cells 
such as stablished c II lin s of mammalian origin. 

55 Expression vectors for host cells ordinarily include an origin of replication, a promoter located upstream 
from the BMP coding sequence, together with a ribosom binding site, a polyadenylation site, and a 
transcriptional t rmination sequ nc . Those of ordinary skill will appreciat that certain of these sequ nces 
ar not requir d for xpression in certain hosts. An xpression v ctor for us with microbes need only 
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contain an origin of replication recognized by the host, a promoter which will function in the host and a 
selection gene. 

An xpression vector is construct d according to the pr s nt invention s that th BMP coding 
sequence is located in the vector with the appropriate regulatory sequences, the positioning and orientation 

s of the coding sequence with respect to the control sequences being such that the coding sequence is 
transcribed and translated under the "control" of the control sequences (i.e., RNA polymerase which binds 
to the DNA molecule at the control sequences transcribes the coding sequence). The control sequences 
may be ligated to the coding sequence prior to insertion into a vector, such as the cloning vectors 
described above. Alternatively, the coding sequence can be cloned directly into an expression vector which 

ro already contains the control sequences and an appropriate restriction site. For expression of BMP in 
prokaryotes and yeast, the control sequences will necessarily be heterologous to the coding sequence. If 
the host cell is a prokaryote, it is also necessary that the coding sequence be free of introns (e.g., cDNA). If 
the selected host cell is a mammalian cell, the control sequences can be heterologous or homologous to 
the BMP coding sequence, and the coding sequence can either be genomic DNA containing introns or 

15 cDNA. Ether genomic or cDNA coding sequences can be expressed in yeast. 

Expression vectors must contain a promoter which is recognized by the host organism. Promoters 
commonly known and available which are used in recombinant DNA construction include the ^-lactamase 
(penicillinase) and lactose promoter systems, a tryptophan (tip) promoter system and the tac promoter. 
While these are commonly used, other known microbial promoters are suitable. 

20 In addition to prokaryotes, eukaryotic cells such as yeast are transformed with BMP encoding vectors. 
Saccharomyces cerevisiae , or common baker's yeast, is the most commonly used among lower eukaryotic 
host microorganisms. However, a number of other species are commonly available and useful herein. Yeast 
vectors generally will contain an origin of replication from the 2 micron yeast plasm id or an autonomously 
replicating sequence (ARS), a promoter, DNA encoding BMP, sequences for polyadenylation and transcript 

25 ion termination, and a selection gene. 

Suitable promoter sequences in yeast vectors include the promoters for the glycolytic enzymes such as 
enolase, 3-phosphoglycerate kinase, glyceraJdehyde-3-phosphate dehydrogenase, hexokinase, pyruvate 
decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. 

30 Other yeast promoters, which have the additional advantage of transcription controlled by growth 
conditions are the promoter regions for alcohol dehydrogenase 1 or 2, isocytochrome C, acid phosphatase, 
as well as enzymes responsible for maltose and galactose utilization. 

Higher eukaryotic cell cultures may be used, whether from vertebrate or invertebrate cells, including 
insects, and the procedures of propagation thereof are known. See, for example, Tissue Culture , Academic 

35 Press, Kruse and Patterson, editors (1973). 

Suitable host cells for expressing BMP in higher eukaryotes include: monkey kidney CVI line trans- 
formed by SV40 (COS-7, ATCC CRL 1651); baby hamster kidney cells (BHK, ATCC CRL 10); Chinese 
hamster ovary-cells-DHFR (described by Urlaub and Chasin, PNAS (USA) 77: 4216 (1980)); mouse Sertoli 
cells (TM4, Mather, J.P., Biol. Reprod. 23: 243-251 (1980)); monkey kidney ceils (CVI ATCC CCL 70); 

40 African green monkey kidney cells (VERO-76. ATCC CRL-1587); human cervical carcinoma cells (HELA, 
ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 
1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep 62, HB 8065); mouse mammary 
tumor (MMT 060652, ATCC CCL 51); rat hepatoma cells (HTC, M1 , 54, Baumann, M., et ah, J. Cell Biol. 85: 
1-8 (1980)) and TRI cells (Mather, J.P., et al., Annals N.Y. Acad. Sci. 383 : 44-68 (1982)). Commonly used 

45 promoters are derived from polyoma, adenovirus 2, and simian virus 40 (SV40). It will be appreciated that 
when expressed in mammalian tissue, the recombinant BMP may have higher molecular weight due to 
glycosylation. It is therefore intended that partially or completely glycosylated forms of BMP having 
molecular weights greater than 19KD are within the scope of this invention as well as its unglycosylated 
forms. 

so A number of procaryotic expression vectors are known in the art. See. e.g., U.S. Patent Nos. 4,440,859; 
4,436,815; 4,431,740; 4,431,739; 4,428,941; 4,425.437; 4,418,149; 4,411,994; 4,366,246; 4,342,832; see also 
U.K. Pub. Nos. GB 2,121,054; GB. 2,008,123; CB 2,007,675; and European Pub. No. 103,395. Preferred 
procaryotic expression systems are in E. coli . 

Other pr ferred expr ssion vectors ar thos for use in eucaryotic systems. An exemplary eucaryotic 

55 expression syst m is that employing vaccinia virus, which is well-known in the art. See. e.g., Macket et al. 
(1984) J. Virol. 49:857; "DNA Cloning," Vol. II, pp. 191-211, supra ; PCT Pub. No. WO 86/07593. Yeast 
expression vectors ar known in th art. S e, _.g.. U.S. Patent Nos. 4,446.235; 4,443,539; 4,430,428; se_ 
also European Pub. Nos. 103.409; 100,561; 96,491. Another preferred expr ssion system is vector pHS1, 
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which transforms Chines© hamst r ovary ceils. See PCT Pub. No. WO 87/02062. Mammalian tissue may be 
cotransformed with DNA encoding a selectabl marker such as dihydrofolate reductase (DHFR) or 
thymidine kinase and DNA encoding BMP. 

If wild type DHFR gene is employed, it is preferable to select a host cell which is deficient in DHFR, 

5 thus permitting the use of the DHFR coding sequence as marker for successful transfection in hgt" 
medium, which lacks hypoxanthine, glycine, and thymidine. An appropriate host cell in this case is the 
Chinese hamster ovary (CHO) cell line deficient in DHFR activity, prepared and propagated as described by 
Urlaub and Chasin, 1980, Proc. Nat. Acad. ScL (USA) 77: 4216. Expression vectors derived from 
baculovirus for use in insect cells are known and available in the art. See Lucklow and Summers, 

10 Biotechnology , 6, p. 47-55. 

Depending on the expression system and host selected, BMP is produced by growing host cells 
transformed by an exogenous or heterologous DNA construct, such as an expression vector described 
above under conditions whereby the BMP protein is expressed. The BMP is then isolated from the host 
cells and purified. If the expression system secretes BMP into growth media, the protein can be purified 

75 directly from cell-free media. If the BMP protein is not secreted, it is isolated from cell lysates. The 
selection of the appropriate growth conditions and recovery methods are within the skill of the art. 

The recombinants made BMP is recovered from transformed cells in accordance with known proce- 
dures. Preferably, an expression vector will be used which provides for secretion of BMP from the 
transformed cells; thus the cells may be separated by centrifugation. The BMP typically is purified by 

20 general protein purification techniques, including, but not limited to, size exclusion, ion-exchange 
chromatography, HPLC, and the like. 

Once a coding sequence for BMP has been prepared or isolated, it can be cloned into any suitable 
vector and thereby maintained in a composition of cells which is substantially free of cells that do not 
contain a BMP coding sequence (e.g., free of other library clones). Numerous cloning vectors are known to 

25 those of skill in the art. Examples of recombinant DNA vectors for cloning and host cells which they can 
transform include the various bacteriophage lambda vectors (E. coli ), pBR322 (E. coli ), pACYC177 (E. coli ), 
pKT230 (gram-negative bacteria), pGV1106 (gram-negative bacteria), pLAFRI (gram-negative bacteria), 
pME290 (non-E. coli gram-negative bacteria), pHV14 (E. coli and Bacillus subtilis), pBD9 (Bacillus), PIJ61 
(Streptomyces), pUC6 (Streptomyces), actinophage, $C31 (Streptomyces), Ylp5 (Saccharomyces), YCp19 

30 (Saccharomyces), and bovine papilloma virus (mammalian cells). See generally , DNA Cloning: Vols. I & II, 
supra ; T. Maniatis et al., supra ; B. Perbal, supra . 

Alternatively the BMP may be made by conventional peptide synthesis, for instance, by using the 
principles of the Merrifield synthesis and using commercial automatic apparatus designed to employ the 
methods of the Merrifield synthesis. Peptides prepared using conventional peptide synthesis may be 

35 purified using conventional affinity chromatography, gel filtration and/or RP-HPLC. 

Figure 3 shows the nucleotide sequence of bovine BMP cDNA and the deduced amino acid sequence 
or the precursor polypeptide. The putative signal peptidase cleavage site is noted (I). The cDNA sequence 
was obtained from a 830 bp Bgl II insert of bovine BMP cDNA clone #1, which was isolated from a calf liver 
cDNA library, as described below. 

40 Figure 5 shows the nucleotide sequence of human BMP cDNA and the deduced amino acid sequence 
of the precursor polypeptide. The putative signal peptidase cleavage site is noted (I). The cDNA sequence 
was obtained from a 600bp BamHI/Hindlll insert of human BMP cDNA clone A6, which was isolated from a 
BMP/PCR human kidney cDNA library, as described below. 

It is further intended from the nucleotides sequences in Figures 3 and 5 that BMP analogs are within 

45 the scope of the present invention. Analogs, such as fragments, may be produced, for example, by pepsin 
digestion of BMP. Other analogs, such as muteins, can be produced by standard site-directed mutagenesis 
of BMP coding sequences. Analcgs exhibiting "BMP activity" may be identified by the in vivo and/or in vitro 
assays, preferably in vitro the cartilage inducing assay, Methods of Enzymology , 146, pp 294-312, (1987). 
An example of a BMP analog are the yeast cleavage products produced in vivo from the full-length 

50 bovine or human BMP expression product. The processing of the bovine expression product results in two 
roughly 16KD polypeptides, described in Example 9 below, and one or both of these analogs are referred to 
herein as the "16KD yeast cleavage analog." 

As mention d above, a DNA sequenc encoding BMP can be prepared synthetically rather than cloned. 
The DNA sequence can be designed with the appropriate codons for the BMP amino acid sequence. In 

55 general, one will select preferred codons for the intended host if the s quence will be used for expression. 
The complet sequence is ass mbled from ov rlapping oligonucleotides prepared by standard methods 
and assembled into a complete coding s quence. See, _.g., Edge (1981) Natur 292 :756; Nambair, et al. 
(1 984) Scienc 223 :1299; Jay J al. (1984) J. Biol. Chem. 259:6311. 
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Synthetic DNA sequences allow convenient construction of g nes which will xpress BMP analogs or 
"mut ins". Alternatively, DNA encoding muteins can be made by site-directed mutagenesis of nativ BMP 
genes or cDNAs, and muteins can be made directly using conv ntional polypeptide synthesis. 

Site-directed mutagenesis is conducted using a primer synthetic oligonucleotide complementary to a 

5 single stranded phage DNA to be mutagenized except for limited mismatching, representing the desired 
mutation. Briefly, the synthetic oligonucleotide is used as a primer to direct synthesis of a strand 
complementary to the phage, and the. resulting double-stranded DNA is transformed into a phage- 
supporting host bacterium. Cultures of the transformed bacteria are plated in top agar, permitting plaque 
formation from single cells which harbor the phage. 

to Theoretically, 50% of the new plaques will contain the phage having, as a single strand, the mutated 
form; 50% will have the original sequence. The resulting plaques are hybridized with kinased synthetic 
primer at a temperature which permits hybridization of an exact match, but at which the mismatches with 
the original strand are sufficient to prevent hybridization. Plaques which hybridize with the probe are then 
picked, cultured, and the DNA recovered. 

75 A general method for site-specific incorporation of unnatural amino acids into proteins is described in 
Christopher J. Noren, Spencer J. Anthony-Cahill, Michael C. Griffith, Peter C. Schultz, (April 1989), Science, 
Vol 244, pp 182-188. This method may be used to create analogs with unnatural amino acids. 

Preparations of BMP and its analogs may be assayed in vivo according to the method described by 
Urist, et al., Methods in Enzymology (D. Barnes and D. A. Sirbaska, eds.), vol. 146, pp. 294-312, Academic 

20 Press, N.Y. (1987), and in vitro by the method of Sato and Urist, Clin. Orthop ., 183:180-187 (1984) as 
modified by Kawamura and Urist, Dev. Biol ., 130:435-442 (1988), all of which are incorporated by reference 
herein. 

Substantially pure BMP, higher molecular glycosylated forms thereof, or active fragments thereof, or the 
nontoxic salts thereof, combined with a pharmaceutical^ acceptable carrier to form a pharmaceutical 

25 composition, may be administered to mammals, including humans, either intravenously, subcutaneously, 
percutaneously, intramuscularly or orally. 

Such proteins are often administered in the form of pharmaceutical^ acceptable nontoxic salts, such as 
acid addition salts or metal complexes, e.g., with zinc, iron or the like (which are considered as salts for 
purposes of this application). Illustrative of such acid addition salts are hydrochloride, hydrobromide, 

30 sulphate, phosphate, maleate, acetate, citrate, benzoate, succinate, malate, ascorbate, tartrate and the like. 
If the active ingredient is to be administered in tablet form, the tablet may contain a binder, such as 
tragacanth, corn starch or gelatin; a disintegrating agent, such as alginic acid; and a lubricant, such as 
magnesium stearate. If administration in liquid form is desired, sweetening and/or flavoring may be used, 
and intravenous administration in isotonic saline, phosphate buffer solutions or the like may be affected. 

35 Pharmaceutical compositions viii usually contain an effective amount of BMP in conjunction with a 
conventional, pharmaceutically acceptable carrier. The dosage will vary depending upon the specific 
purpose for which the protein is being administered, and dosage levels in the range of about 0.1 tig to 
about 100 milligrams per Kg. of body weight may be used. 

The present invention further contemplates the combination of BCF, particularly human or bovine, with 

40 BMP as a therapeutic composition for initiating bone formation both in vitro and in vivo. 

Implants of recombinant BMP, alone or when mixed with one or more osteoinductive associated factors 
may be used to initiate cartilage and bone growth. The implants can be time-release composition 
encapsulated, for instance, in liposomes or other time-release membranes, natural or synthetic, which are 
absorbable by the host subject. The BMP may be either human, bovine or any other mammalian form, or 

45 mixtures thereof. To initiate bone growth, BMP may be mixed with any combination of one or more other 
proteins, particularly, with one or more other proteins derived from bone. Such mixtures may initiate 
cartilage formation, followed by bone growth. Implants of BMP may be used to induce cartilage and bone 
growth in the quadriceps compartment. 

The purification protocols, described in detail below, allow for the first time the purification of native 

so BMP in sufficient quantity and at a high enough purity to permit accurate amino acid sequencing. The 
amino acid sequences derived from the purified BMP allow for the design of probes to aid in the isolation of 
native BMP nucleic acid sequence, or the design of synthetic nucleic acid sequences encoding the amino 
acid sequence of BMP, as well as allowing both diagnostic and therapeutic antibodi s to BMP and its 
analogs to be produced for the first time. BMP DNA genes or fragments may also be utilized in a diagnostic 

55 test for identifying subjects having defective BMP-genes. 

Specific anti-s ra or monoclonal antibodies (d scribed below) can b mad to a synthetic BMP peptid 
having the sequ nc or fragments of th sequ nee of amino acid residues, such as those shown in Figures 
3 or 5. Examples ar th tryptic fragments shown in FIG. 1, and antibodi s th r to can be used to 
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immunoprecipitate any BMP present in a selected tissue, cell extract, or body fluid. Purified BMP from this 
s urce can then be sequenced and used as a basis for designing specific probes as described abov . 
Antibodies to other regions that diverg from known BMP can also be used. Also useful as antigens are 
purified native or recombinant BMP. 

5 Native, recombinant or synthetic BMP peptides (full length or subunits) can be used to produce both 
polyclonal and monoclonal antibodies. If polyclonal antibodies are desired, purified BMP peptide is used to 
immunize a selected mammal (e.g., mouse, rabbit, goat, horse, etc.) and serum from the immunized animal 
later collected and treated according to known procedures. Compositions containing polyclonal antibodies to 
a variety of antigens in addition to BMP can be made substantially free of antibodies which are not anti- 

10 BMP by immunoaffinity chromatography. 

Monoclonal anti-BMP antibodies can also be readily produced by one skilled in the art from the 
disclosure herein. The genera) methodology for making monoclonal antibodies by hybridomas is well 
known. Immortal, antibody-producing cell lines can also be created by techniques other than fusion, such as 
direct transformation of B lymphocytes with oncogenic DNA, or transfection with Epstein-Barr virus. See, 

75 e.g., M. Schreier et al., "Hybridoma Techniques" (1980); Hammerling et al., "Monoclonal Antibodies And T- 
cell Hybridomas" (1981); Kennett et aL, "Monoclonal Antibodies" (1980); see also U.S. Patent Nos. 
4,341,761; 4,399,1 21 ; 4,427,783; 4,444,887; 4,451,570; 4,466,917; 4,472,500; 4,491,632; 4,493,890. 

Panels of monoclonal antibodies produced against BMP peptides can be screened for various 
properties; i.e., isotype, epitope, affinity, etc. Of particular interest are monoclonal antibodies that neutralize 

20 the activity of BMP. Such monoclones can be readily identified in BMP activity assays. High affinity 
antibodies are also useful in immunoaffinity purification of native or recombinant BMP. 

Antibodies to BMP (both polyclonal and monoclonal) forms described herein may be used to inhibit or 
to reverse various clinical indications of bone disease such as osteoporosis, osteoarthritis, etc. One 
therapeutic approach would be to treat the patient with an effective dose of anti-BMP antibodies through a 

25 conventional intravenous route. BMP antagonists or agonists, such as BMP muteins, could also be used in 
place of antibodies. These anti-BMP compositions may also be useful in dectecting or inhibiting various 
forms of tumors, since some tumors are known to be induced by growth factors. 

The determination of the appropriate treatment regimen (i.e., dosage, frequency of administration, 
systemic vs. local, etc.) is within the skill of the art. For administration, the antibodies will be formulated in a 

30 unit dosage injectable form (solution, suspension, emulsion, etc.) in association with a pharmaceutical ly 
acceptable parenteral vehicle. Such vehicles are usually nontoxic and nontherapeutic. Examples of such 
vehicles are water, saline, Ringer's solution, dextrose solution, and Hank's solution. Nonaqueous vehicles 
such as fixed oils and ethyl oleate may also be used. A preferred vehicle is 5% (w/w) human albumin in 
saline. The vehicle may contain minor amounts of additives, such as substances that enhance isotonicity 

35 and chemical stability, e.g., buffers and preservatives. The antibody is typically formulated in such vehicles 
at concentrations of about 1 ug/ml to 10 mg/ml. 

Anti-BMP antibodies will also be useful in diagnostic applications. The present invention contemplates a 
method, particularly a diagnostic method, in which a sample from a human (or other mammal) is provided, 
and the amount of BMP is quantitatively measured in an assay. For example, employing anti-BMP 

40 antibodies in a quantitative immunoassay could be used to detect genetic deficiency in BMP. Antibody 
specific for BMP could be formulated into any conventional immunoassay format; e.g., homogeneous or 
heterogeneous, radioimmunoassay or ELISA. The various formats are well known to those skilled in the art. 
See, e.g., "Immunoassay" A Practical Guide" (D.W. Chan and M.T. Perlstein eds. 1987) the disclosure of 
which is incorporated herein by reference. 

45 In general, production of recombinant BMP can provide compositions of that polypeptide substantially 
free of contaminating proteins. The ability to obtain high levels of purity is a result of recombinant 
expression systems which can produce BMP in substantial quantities vis-a-vis in vivo sources. Thus, by 
applying conventional techniques to recombinant cultures, BMP compositions can be produced that are 
substantially more pure than the compositions available from bone sources. 

so Purified BMP will be particularly useful as a tool in the design and screening of cartilage or bone growth 
inhibitors. First, milligram amounts of the material are obtainable according to the present invention. 
Milligram amounts are capable of crystallization to permit three dimensional studies using X-ray diffraction 
and computer analysis. This may permit deduction concerning the shape of th mot cul , thus defining 
proper shapes for substances usable as inhibitors of th activity normally exhibit d by BMP. Generally, 

55 antagonists have been p ptid s wh se interactions with an a polypeptid , th activity of which is inhibited, 
are stabilized by modification of the "r sidues" participating in the peptide bond so as to nhance th 
ability of the peptide to interact specifically with th nzym , receptor, or co-factor, such as osteoinductiv 
associated factors in the case of BMP. Thus the peptid bond joins specifically chosen carboxylic acids and 
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amines (not necessarily amino acids). These peptides ar configured in a three dimensional array so as to 
complement the contours of the intended target. A similar lock and key spatial arrangement may result from 
molecules design d complem ntary to th surfac contours of the BMP of th inv ntion. It is und rsto d 
that "surface" includes convolutions which may face inward, and specifically includes the active site. 
5 Furthermore, "complementary" is understood to mean that, in addition to spatial conformations which "fit", 
interactions between the protein and the molecule which matches its surface contours are attractive and 
positive. These interactions may be hydrogen bonding, ionic or hydrophobic affinity. 

Accordingly, the invention contemplates peptide antagonists and agonists (2-15 amino acids) to BMP 
which are characterized by three dimensional contours complementary to the three dimensional contours on 
io the surface of recombinant BMP. By peptide in this context is meant that the antagonist or agonist contains 
carboxylic acid amide bonds. The carboxylic acid and amine participants need not be a-amino acids. 

Second, even without the assistance of a three dimensional structure determination, purified BMP of the 
invention is useful as a reagent in screening BMP inhibitors in vitro as an ad hoc approach to evaluation. 
Impure BMP preparations currently available yield confusing data due to impurities. For example, con- 
75 taminants which turn out to be themselves inhibitors, activators, or substrates for BMP may interfere with 
the evaluation. Thus, a substantial improvement in current screening techniques for BMP agonists and 
antagonists would be effected by the availability of the purified BMP protein. 

It will be understood that this description and disclosure of the invention is intended to cover all 
changes and modifications of the invention which are within the spirit and scope of the invention. It is within 
20 the knowledge of the art to insert, delete or substitute amino acids within the amino acid sequence of a 
. BMP without substantially affecting the calcification and bone growth inducing activity of the molecule. 
Thus, the invention includes such deletions, additions or substitutions. Furthermore, it is recognized that one 
skilled in the art could recombinants produce such modified proteins. 

The following examples are provided by way of illustration but are not intended to limit the invention in 
25 any way. 

EXAMPLE 1 

Purification of BMP From Bone 

30 

The BMP proteins of interest, partially purified from human (17KD) and bovine (19KD) sources as 
described by Urist, et al., Proc. Nat. Acad. Sci. USA , 81, 371-375 (1984), were further purified to 
homogeneity by preparative gel electrophoresis and electroelution (M.W. Hunkapiller, E. Lujan, F. Ostrander 
and LE. Hood. Methods in Enzymology , 91: 227-236 (1983)). This purification showed that the initial 

35 partially purified samples contained, in addition to the 19KD BMP, other mammalian proteins at 34KD, 
22KD, 14KD and 6KD. After precipitation with acetone (W. H. Konigsberg and L. Henderson, Methods in 
Enzymology , 91: 254-259 (1983)) and quantitation by amino acid analysis (B.A. Bidlingmeyer, SA Cohen 
and T.L Tarvin, Journal of Chromatography , 336 : 93-104 (1984)), the material was reduced under 
denaturing conditions with 2-mercaptoethanol and cysteine residues were derivatized with 4-vinyl-pyridine 

40 (M. Friedman, L.G. Krull and J.F. Cavins, Journal of Biological Chemistry , 245 : 3868-3871 (1970)). After 
exhaustive dialysis to remove the denaturant, protein recovery was assessed by a repetition of amino acid 
analysis. The proteins were digested with TPCK-trypsin in the presence of 2M urea to generate unblocked 
peptide fragments suitable for sequence analysis (C. Allen, Sequencing of Proteins and Peptides , pages 51- 
62 (1981), Elsevier/North Holland Publishing Company, Amsterdam, Holland). Products of the digestion 

45 were resolved by reverse-phase high performance liquid chromatography using gradients of acetonitrile or 
acetonitrile/isopropanol in aqueous trifluoroacetic acid (J.E. Shively, Methods of Protein Microcharac- 
terization , pages 41-87 (1986), Humana Press, Clifton, New Jersey). Peptide fractions were subjected to 
automated Edman degradation using an Applied Biosystems 470A protein sequencer (M.W. Hunkapiller, 
R.M. Hewick. W.J. Dreyer and L.E. Hood, Methods in Enzymology , 91; 399-413 (1983)). The phenyl- 

so thiohydantoin amino acid derivatives were identified by chromatography on an Applied Biosystems 120A 
PTH analyzer (M.W. Hunkapiller, Applied Biosystems , User Bulletin Number 14 (1985), Applied Biosystems, 
Foster City, California). 



55 
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EXAMPLE 2 

RNA Isolation and Probe Synth sis 

5 Cell culture. 293, an embryonic human kidney cell line obtained from the American Type Culture 
Collection (ATCC No. CRL 1573) was cultured in Dulbecco's modified Eagle's medium containing 10% fetal 
calf serum, 100 U/ml penicillin and 100 ug/ml streptomycin at 37 *C in 5% CO2. 

RNA isolation . RNA was isolated from calf liver (obtained from JR Scientific, Woodland, CA) by the 
guanidinium thiocyanate/CsCI method (Maniatis, T. Fritsch, E.F. and Sambrook, J. (1982) Molecular Cloning: 

10 A Laboratory Manual (Cold Spring Harbor Lab., Cold Spring Harbor, NY) and Freeman, G.J., Clayberger, C, 
DeKruyff, R., Rosenblum, D.S. and Cantor, H. (1983) Proc. Natl. Acad. Sci.: USA 80 : 4094-4098). PoIy(A) + 
RNA was purified by a single fractionation over oligo<dT>cellulose (Maniatis, T. Fritsch, E.F. and Sambrook, 
J. (1982) Molecular Cloning: a Laboratory Manual (Cold Spring Harbor Lab., Cold Spring Harbor, NY)). 
Oligonucleotide synthesis . Oligonucleotide adapters, primers and probes were synthesized by the 

75 phosphoramidite method with an Applied Biosystems (Foster City, CA) model 380A synthesizer, purified by 
polyacrylamide gel electrophoresis and desalted on a Waters SEP-PAK (Cis) cartridge. 

(a) Adapters. A 14-mer oligonucleotide (5' CCTGTAGATCTCCG 3*) and a 18-mer oligonucleotide (5' 
AATTCGGAGATCTACAGG 3') were synthesized and used as the EcoRI adapters for the cDNA library 
constructed in lambda ZAPII. The 14-mer was phosphorylated (Maniatis, T. Fritsch, E.F. and Sambrook, 

20 J. (1982) Molecular Cloning: a Laboratory Manual (Cold Spring Harbor Lab., Cold Spring Harbor, NY)) 
and subsequently heated to 95°C for 15 min. to inactivate polynucleotide kinase, prior to annealing with 
the 18-mer. These asymmetrically phosphorylated adapters also contain an internal Bglll restriction 
enzyme site. 

(b) PCR primers. Two 47-mer oligonucleotides were synthesized for PCR amplification of the complete 
25 coding region of human BMP cDNA from kidney cell line 293 mRNA and subsequent cloning into 

lambda ZAPII. The 5* sense primer (5' TCCGGACTCGAGGAATTC AAACAATCATGATTTCCAGAATG- 
GAGAAG 3*) and 3* antisense primer (5' GACTCGAGGAATTCGTCGACACCTCAAGGCCGTTACTCAAA 
GTCAGT 3') are based on the sequence of the human gene exons 1 and 7, respectively. Both primers 
have 5' extensions (adapters) that contain several restriction enzyme sites. 

30 Two additional sets of PCR primers/adapters were synthesized for amplifying bovine and human 

BMP cDNA clones (encoding only the mature protein) and subsequent cloning into pAB125, a plasmid 
containing the yeast a-f actor leader sequence. The 5' sense primer (42-mer) for the human BMP cDNA 
(5' ACTACTGGT CTAGATAAAAGATTCCCAGTGTACGACTACGAT 3 1 ) and the 5' sense primer (41-mer) 
for the bovine BMP cDNA (5* GGAACCCTCTCTAGATAAAAGATTCCCGGTGTATGACTATG 3') contain 

35 an Xbal site to facilitate ligation of the PCR amplified cDNA to the a-f actor leader sequence. The 3' 
antisense primer (37-mer) for the human BMP cDNA (5* TCTTACCTGTCGACTATTACTCAAAGTCAG- 
TATTTAT 3') and the 3' antisense primer (41-mer) for the bovine BMP cDNA (5* AATGGCCGTCGAC- 
TATCACTCAAAGCCAGGGTTTACTCTTG 3') contain a Sal! site directly after the stop codon. 

(c) Probes. Oligonucleotide probes, based on the tryptic peptide sequences of bovine BMP, were 
40 synthesized for screening the bovine genomic library (Probes A-D) and are shown in Figure 1. The actual 

probes are complementary to the sequences shown. The sequences of four additional BMP tryptic 
peptides were determined (E-H) and are also shown in Fig. 1. With some codons, two nucleotides were 
included at degenerate positions to increase the probability of a correct guess. At serine residues, two 
separate oligonucleotides for each probe (differing only at the serine codon) had to be synthesized due 
45 to the sunique nature of the serine codon degeneracy. 

Two partially overlapping 50-mer oligonucleotides, based on the bovine BMP exon 3 sequence, were 
synthesized for screening the calf liver cDNA library (Probe I) and are shown in Fig. 2a (underlined). Four 
oligonucleotide probes were synthesized to identify exons 1, 5, 6, and 7 of the human BMP gene (see 
Table .1). Probe BMP103 (5' CCAGTGGTTCATTCCAAGGA CAAATATCACCAACATCTTCATCGCCATCTTC- 
50 TCCAT 3*) is a 57-mer complementary to bovine BMP exon 1 coding sequences. Probe BMP104 (5' 
TCACTCAAAGCCAGGGTTTACTCTTGCTCTGGGCC ACGGGTTCGAGTACCTTCTATT 3') is also a 57-mer 
complementary to bovine BMP exon 7 coding sequences. BMP 116 (5' GAGACCAAATAGATAATTGTTTC- 
TCCATTTATGAGATCC 3') is a 39-mer complementary to human BMP exon 5 coding sequences. BMP 117 
(5' CAAGTGACCGATCATAAAA TTGTTCACTTATGGACTCGTCTGAAATGAGA 3') is a 51-mer complemen- 
55 tary to human BMP exon 6 coding sequences. A 26-mer oligonucleotid (5' TAGTCCCTCTGGAAGG- 
CACATGTAGC 3') compl mentary to human BMP exon 3 coding sequences (Probe J) was synthesized for 
sere ning the human BMP/PCR cDNA library. 
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EXAMPLE 3 

Construction and Screening of cPNA Libraries 

5 Construction of the cPNA libraries, (a) Calf liver cDNA library. First strand cDNA was synthesized from 
calf liver poly(A) + RNA using conditions similar to Okayama and Berg [Okayama, H, and Berg, P. Molec. 
and Cell Biol . 3:4094-4098 (1983)]. About 5 ug of poly(A) + RNA in 20 ul 5 mM Tris-hydrochloride (pH 7.5) 
was heated to 65 *C for 3 min., then quick cooled on wet ice and immediately adjusted (at room 
temperature) to contain 50 mM Tris-hydrochloride (pH 8.3 at 42 *C), 8 mM MgCb 30 mM KCI, 10 mM 

w dithiothreitol, 2 mM each of dATP, dGTP, dTTP and o-^PJdCTP^SOOcpm/pmol). 60 U RNasin, and 2.5 ug 
of oligo (dT)i 2 -i8 (total volume 40 ml). The reaction was initiated by the addition of 50-60 U of cloned 
moloney murine leukemia virus reverse transcriptase and continued for 60 min. at 42 ° C. The second cDNA 
strand was synthesized by the method of Gubler and Hoffman [Gubler, U. and Hoffman. B.J. Gene 25:263- 
269 (1983)] as modified by Aruffo and Seed [Aruffo, A. and Seed. B. Proc. Natl. Acad. Sci.: USA 74:8573- 

75 8577 (1987)]. The ds cDNA was then ligated to asymmetrically (hemi) phosphorylated EcoRI adapters (see 
oligonucleotide synthesis) as described by Aruffo and Seed, supra , phosphorylated with T* polynucleotide 
kinase (Maniatis, et al., supra ), adjusted to 0.5 M NaCI, 25 mM EDTA and heated at 75*C for 15 min. to 
inactivate the polynucleotide kinase. The ds cDNA was separated from unligated adapters by chromatog- 
raphy on Biogel A-15 m and recovered by ethanol precipitation. cDNA was ligated to EcoRI-cut lambda 

20 ZAPII (Stratagene) with T* DNA ligase (New England Biolabs) as described by supplier, but included 1 5% 
polyethylene glycol (PEG) 8000 (Sigma), a modification described by Pheiffer and Zimmerman [Pheiffer, 
B.H. and Zimmerman. S.B. Nucl. Acids. Res . 11;7853-7871 (1983)]. The ligated DNA was recovered by 
centrifugation (12,000 xg), washed with chloroform, dried, resuspended in 4 ul water and incubated with an 
in vitro packaging extract (Stratagene) according to supplier. Recombinant phage was propagated in E. coli 

25 XLI-Blue (Stratagene). 

(b) BMP primed-PCR amplified (BMP/PCR) human kidney cDNA library. The PCR reactions were 
performed as described by the suppliers of the PCR kit (Perkin/Elmer/Cetus). Two synthetic 47-mer 
oligonucleotide primers whose sequences were derived from exon 1 (sense primer) and 7 (antisense 
primer) of the human BMP gene and contained restriction site adapters suitable for cloning were used at a 

30 final concentration of 1 uM each. The PCR primers flank the complete coding region of hBMP mRNA. The 
template cDNA was synthesized from 2.5 ug of embryonic human kidney cell line 293 poly(A) + RNA. The 
conditions of cDNA synthesis were identical to the above (Part A) except that the reaction volume was 20 
Ul. The cDNA was fractionated on Biogel A-15m, also as above, recovered by ethanol precipitation and 
resuspended in 100 ul of sterile water. 1-10 ul of cDNA template were used for each PCR reaction. 40 

35 cycles of PCR were performed in a Perkin/Elmer/Cetus DNA thermal cycler. Each cycle consisted of a 
94 °C, 1 min. denaturation step, a 55 *C, 2 min. annealing step and a 72 °C, 3 min. extension step. The 
extension step in cycle 40 was 15 min. instead of 3 min. Samples were extracted once with phenol/ 
chloroform/IAA (1:1:0.04) once with chloroform/IAA (24:1), recovered by ethanol precipitation, digested with 
EcoRI and fractionated by electrophoresis on a 7% acrylamide, 1x TBE gel. DNA migrating between 400- 

40 800 b.p. was excised from the gel, purified by passage over an Elutip-d column, ligated to Eco-RI cut 
lambda ZAPII, packaged and propagated as above (Part A). 

Screening of the libraries , (a) Bovine genomic library. Approximately 10 6 recombinant phage from a 
bovine genomic library (Clontech) were plated (20,000 phage/137 mm dia plate) in E. coli LE392, and grown 
for 5-6 hours at 37 *C. The phage were transferred onto nitorcellulose filters (Millipore, HATF137) processed 

45 according to Benton and Davis (8) and screened with probe A (Rg. 1). The probes were end-labeled with T* 
polynucleotide kinase and ( i32 -P) ATP (1) to a specific activity of 1-2 x 10 8 cmp/ug. The filters were 
prehybridized for 1-2 h at 37* C in 5x SSC (1x SSC = 0.15 M sodium chloride/0.015 M sodium citrate, pH 
7), 5x Denhardt's solution (1x Denhardt's solution = 0.02% polyvinylpyrrolidone/0.02% Rcoll/0.02% bovine 
serum albumin), 10% dextran sulfate, 50 mM sodium phosphate pH 6.8, 1 mM sodium pyrophosphate, 

so 0.1% NaDodSO* and 50 ug/ml denatured salmon sperm DNA. Labeled probe was added to a concentration 
of to 6 cpm/ml and hybridization was continued overnight at 37 *C with gentle shaking. The filters were 
washed in 2xSSC, 0.1% NaDodSO* at 55 *C, and exposed to Kodak XAR-2 film with a DuPont Lightning 
Plus intensifying screen vemight at -80 * C. After d velopm nt, the probe was remov d from the filters by 
washing in O.lxSSC, 0.1% NaDodSO* at 65 *C. One set of filters was then hybridized with probe B (Fig. 1) 

55 and washed and expos d as above. Ar as of plaqu s giving signals with probes A and B wer picked, 
replat d, transferr d onto nitroc llulose in quadruplicate, amplified according to Woo (21) and scr ened with 
probes A-D (Rg. 1) (one probe p r filt r). Rlters w r washed and xposed to film as above. A plaqu 
giving signals with three (A-C) of the four probes was purifi d by an additional round of plating and 
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screening. 

(b) Caff liver cDNA library. Approximately 192.000 recombinant phage were plated (16.000 phage/137 
mm dia plate) in E. coli XLI-Blue, processed as above and screened with E. coli probe I (Fig. 2a). The 
probe was labeled with DNA polymerase I (Klenow fragment) and (a^-PJ-d CTP (9) to a specific activity of 

5 2 x 10 9 cpm/ug. The filters were screened as above (a) but with the following changes. (1) The 
hybridization solution contained 40% formamide and (2) the filters were washed in 2xSSC, 0.1% NaDodSCU 
at65°C. 

(c) Human genomic library. Approximately 10* recombinant phage from a human genomic library 
(Stratagene) were plated (50,000 phage/plate) in E. coli LE392, processed and screened with the 780bp 

10 Bglll insert of bBMP cDNA#1 (Fig. 3). The probe was labeled (9) and the filters were hybridized as above 
(see calf liver cDNA library). The filters were washed in 2xSSC, 0.1% NaDodSO* at 60 * C. Positive plaques 
were purified by replating and rescreening. 

(d) (BMP/PCR) human kidney cDNA library. Approximately 1000 recombinant phage were plated (500 
mm dia plate) in E. coli. XL1 Blue, processed, hybridized with probe J (Fig. 4b, underlined) and washed as 

75 in (a). Positive plaques were purified by replating and rescreening. 

EXAMPLE 4 

Plasmid and Phage DNA Isolation, Subcloning, Sequencing and Analysis 

20 

Plasmid DNA was isolated by the alkaline lysis method (Maniatis, et a}., supra ) and lambda DNA was 
isolated by a phage miniprep procedure described by Jones and Rawls, [Jones, K.W. and Rawls, J.M 
Genetics 120:733-742 (1988)]. 

Bluescript SK(-) plasmids containing BMP cDNA were released from lambda ZAP by the M13 

25 rescue/excision protocol described by the supplier (Stratagene). BMP gene fragments were released from 
the EMBL3 lambda vector by sail or other appropriate restriction enzyme digestions (see Fig. 2 and Table 
1). BMP cDNA inserts were excised frorh the Bluescript SK(-) vector by a Bglll (bBMP cDNA) or a 
BamHt/Hindlll (hBMP cDNA) digestion. The DNA fragments were purified by polyacrylamide or agarose gel 
electrophoresis (Maniatis, et al., supra ) and passage over an Elutip-d column (Schleicher and Schuell) and 

30 were then subcloned into pUC 19 or M13 sequencing vectors [Yamisch-Perron, C, Vieira, J. and Messing, 
J., Gene 33 :103-119 (1985)]. DNA sequencing was performed by the dideoxy chain termination method 
(Sanger, F., Nicklen, S. and Coulson, A.R., Proc. Natl. Acad. Sci. USA 74:5463-67 (1977)] using M13 
primers as well as specific internal primers. Ambiguous regions were resolved using 7-deaza-2-deox- 
yguanidine-triphosphate [Barr, P.J., Thayer, R.M., Laybourn, P., Najarian, R.C., Seela, F., and Tolan, D., 

35 Biotechniques 4:428-32 (1986)] and sequenase (U.S. Biochemicals). 

Northern blot analysis . Poly(A) + RNA was fractionated on a 1.4% agarose gel in the presence of 
formaldehyde (Lehrach, H. f Diamond, D. f Wozney, J.M. and Boedtker, H., Biochemistry 16:4743-51 (1977)] 
and directly transferred to nitrocellulose according to Thomas (Thomas, P., Proc. Natl. Acad. Sci. USA 
77:5201-5 (1980)]. 

40 Filters were hybridized with probe I or bBMP cDNA #1 and washed as previously described above in 

screening of the libraries, section B. 

Southern blot analysis . Genomic blots: 10 ug of human, bovine and mouse genomic DNA (Clontech) 

was digested with EcoRI, fractionated on a 0.7% agarose gel and transferred to nitrocellulose (Maniatis, et 

al., supra ). Hybridization and washing were identical to those described under "Northern blot analysis." 
45 Clone and PCR blots: DNA from genomic clones, cDNA clones or PCR reactions were digested with various 

restriction enzymes, fractionated on 1% agarose gels and transferred to nitrocellulose (Maniatis, et al., 

supra ). 

EXAMPLE 5 

50 

Expression of Bovine and Human B MPs in Yeast 

Th PCR prim rs d scribed in detail above were used to g nerat DNA sequences with Xba-1 and Sal- 
1 restriction sites for direct in-frame cloning into pAB125, a vector containing the a-factor leader (describ d 
55 in EPO Pub. No. 0 116 201. published 22 August 1984) sequence fused to the ADH2/GAPDH promot r 
(describ d in a copending U.S. Patent Application Ser. No. 760,197, filed 29 July 1985). The resulting 
promoter/leader/gen cass ttes wer xcised with Bam Hi and Sal-I and cloned into th yeast xpression 
vector PBS24.1. The plasmid PBS24.1 is a d rivativ a pBS24 which is d scribed in copending U.S. Pat nt 
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Application S r. No. 138,894, filed 24 December 1987. The BamHI-Sall v ctor fragment of pBS24 was 
ligated to a 65 base pair human basic FGF BamHI-Sall fragment to create pBS24.1 . This 65 base pair 
fragment will be replaced by the BamHI-Sall promoter/leader/BMP cassettes which are ligated into pBS24.1 
The resulting plasmids pBS24bBMP and pBS24hBMP were used to transform yeast strain AB110 with 
5 genotype Mata, ura3-52, leu2-04 or both leu2-3 and Ieu2-112 t pep4-3. his4-580 l cir«, under leucine 
selection media, such as the synthetic complete media without leucine in Methods in Yeast Genetics , Cold 
Spring Harbor Laboratories, 1986. F. Sherman, G.R. Fink and J.B. Hicks. For induction of expression, cells 
were grown for 48h the uracil deficient media below: 

20 g Casamino Acids 
10 5 g Ammonium Sulfate 

1 g Potassium Phosphate 

0.5g Magnesium Sulfate 

0.1 g Sodium Chloride 

0.1 g Calcium Chloride 
is 0.04ml Trace Elements Mixture 

41 1 2% Sodium Molybdate 

0.01 5g Vitamin Mix 

0.03 g Pantothenate 

6.03 g Inositol 
20 70 ml 50% Glucose 

q.s. to a 900ml and pH to 6.0 

q.s to 1000 ml 

Trace Elements Mixture 

25 

5 g Boric Acid 
0.4g Cupric Sulfate 

1 g Potassium Iodide 

2 g Ferric-Chloride 

30 4 g Manganese Sulfate 

2 g Sodium Molybdate 
4 g Zinc Sulfate 

q.s. to 1000ml and sterilize by filtration. 
35 Vitamin Mix 

3 g Myo-lnositol 
3 g Thiamine 

3 g Pyridoxine 
40 3 g Calcium Pantothenate 
0.2g Biotin 

2 g p-Aminobenzoic Acid 
2 g Riboflavin 
0.2g Folic acid 
45 3 g Niacin 

Yeast cells were removed from the culture media by centrifugation and the proteins in the supernatants 
analyzed after precipitation with 10% trichloroacetic acid/deoxycholate (0.4 mg/ml). The pellets were 
washed with acetone and loaded onto 15% SDS-polyacrylamide gels together with appropriate controls. 
Proteins were visualized by Coomassie blue staining. A samples of S. cerevisiae strain AB110 containing 
. so pBS24.1bBMP or pBS24.1hBMP were deposited with the ATCC on 1 June 1989 under Accession Nos. 
20949 and 20950. 

EXAMPLE 6 

55 Recombinant Bovin BMP Purification 

Medium was r moved from yeast cells by centrifugation, and concentrated approximat ly t n fold. Th 
pH was adjusted to 4.5 and th cone ntrat diiut d to a conductivity below 5 mS/cm. This was applied to 
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Fast Flow S ion-exchang resin (Pharmacia) pre-equilibrated with 50 mM sodium acetate, 1 mM EDTA, 1 
mM PMSF (pH 4.5). The column was washed with one volume of the above buffer and eluted using a 0-1 M 
sodium chloride gradient in the above buffer. The 19KD and 16KD proteins eluted at a conductivity of 5-25 
mS/cm. as confirmed by SDS-PAGE. The fractions containing predominantly the 19KD protein were pooled, 
5 adjusted to pH 7.5 and 4.5 M with respect to urea, concentrated, and applied to an S-100 sizing column in 
4M urea, 100 mM Tris/HCI, 1 mM EDTA, 1 mM PMSF (pH 7.5). Fractions containing 19KD and 16KD 
proteins were identified by SOS-PAGE, pooled separately, concentrated and dialyzed against water, prior to 
lyophilization. 

70 EXAMPLE 7 

ASSAY FOR BMP ACTIVITY 

Samples of purified recombinant BMP were added to CMRL-1066 (GIBCO) culture medium containing 
75 fetal rat midbelly triceps brachii muscle fragments. Connective tissue outgrowths were cultured on a 
substratum of BMP-free matrix from the diaphyses of long bones of adult male Sprague-Dawley rats. This 
assay is detailed more fully by Kawamura and Urist, Developmental Biology, 130, 435-442 (1982). The 
inductive activity is measured by pHJthymidine incorporation into DNA, psjsulfate incorporation into 
glycosaminoglycans and confirmation of chondrogenesis (cartilage formation) by histology. Recombinant 
20 BMP was tested at concentrations of 200 ng/ml to 5 g/ml in the 2 ml culture system. The same level of 
positive results was observed with recombinant BMP as was previously noted with native BMP at equivalent 
concentrations. Higher [ 3 H]thymidine and PSJsuifate incorporation and new cartilage and chondrocytes 
were observed in the recombinant BMP induced cultures. No cartilage or chondrocytes were seen in control 
cultures without recombinant BMP. 

25 

EXAMPLE 8 

Isolation and Analysis of the BMP gene and cDNA 

30 I. The Bovine BMP gene. 

Probes A and B (Fig. 1) were used to isolate 1 strongly and 7 weakly hybridizing clones from 10 s 
recombinants of a bovine genomic library. The one strongly hybridizing clone (29-bg-3) was subjected to a 
second round of screening with probes A-D (Fig. 1). Three of the probes (A-C) hybridized to 29-bg-3. 

35 Southern blot analysis of purified 29-bg-3 DNA localized probes A and B to a 1 .9kb EcoRI fragment and 
probe C to a 1.2kb EcoRI-Sall fragment Both fragments were sequenced and shown to contain regions, in 
the top reading frame, encoding their respective tryptic peptides (Fig. 2; boxed amino acids). Interestingly, 
tryptic peptide D was also present but was split between two exons, thus explaining the lack of hybridization 
with probe D. Intron-Exon boundaries conform to the GT-AG rule and are denoted by vertical lines. 

ao Additional bovine BMP exons were not isolated. Attempts were directed at isolating a bovine BMP cDNA 
clone using as a probe exon sequences derived from the bovine BMP gene. 

II. The Bovine BMP cDNA 

45 Northern blot analysis of poly(A) + RNA from several bovine tissues using probe I revealed calf liver as a 
good source of bovine BMP mRNA (data not shown). Probe I was then used to isolate putative bovine BMP 
cDNA clones from a calf liver cDNA library. Two clones (#1 and #7) were sequenced, and shown to contain 
identical overlapping sequences as well as the expected encoded tryptic peptides (A-D) (Fig. 3). The 
additional BMP tryptic peptides (E-H, Fig. 1) were also found encoded in the bovine BMP cDNA. 

50 

HI. The Human BMP gene 

North rn blot analysis of poly(A) + RNA from s v ral human tissues and c II lines, including liver, kidney 
and osteosarcoma, (using bovin BMP cDNA #1 as a prob ) failed to d tect human BMP. However, a 
55 Southern blot, performed und r identical hybridization and wash conditions detected human BMP gene 
fragments (data not shown). 

Based on th North rn and Southern results, the following strategy was adopted to (1) clon and 
sequence th human BMP gen , then (2) identify a tissu source of human BMP mRNA by PCR 
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amplification (using human BMP based primers: BMP/PCR cDNA) and Southern blot analysis of th 
products 3) clone and isolate the PCR gen rated human BMP cDNA. 

Bovine BMP cDNA #1 was used as a probe to isolate 7 strongly and 5 weakly hybridizing clones from 
10* recombinants of a human genomic library. Southern blot analysis of purified DNA from 11 of the 12 

5 clones identified two hybridizing Hindlll fragments (1.7 kb and 2.0 kb) common to bee clones HG4, 5 and 9. 
The 1.7 kb and 2.0 kb Hindlll fragments from HG9 were sequenced and shown to have a 63.4% and 62.2% 
amino acid homology to the bovine BMP exons shown on Figure 2A and 2B respectively. Table 1 
summarizes these results as well as the Southern blot results for the remaining human BMP exons. The 
sequence of the exort-containing region of each subclone is shown in Figure 4a-c. The 5' end of exon 1 

ro (Cap site) and the 3* end of exon 7 (poly(A) + addition site) are unknown. The intron/exon boundaries follow 
the GT-AG rule and are denoted by vertical lines. The DNA was translated in all three reading frames. The 
middle and top frames contain the correct amino acids for exons 1-6 and exon 7, respectively (boxed). 

IV. The Human BMP cDNA 

75 

Kidney (cell line, 293) was identified as a source of hBMP mRNA by PCR amplification and Southern 
blot analysis. 10 3 recombinants of a cDNA library made from the hBMP/PCR cDNA, were screened with 
probe J (human exon 3 probe) and 12 putative hBMP cDNA clones were isolated. Agarose gel elec- 
trophoresis following a BamHI/Hindlll digestion of the clones showed that each clone contained either a 
20 700bp or a 600bp cDNA insert. DNA sequencing revealed that the 700bp species was te expected full 
length human BMP cDNA (Fig. 5) while the 600bp species was a truncated human BMP cDNA, missing 
exon 2 (not shown). This shorter cDNA species most likely represents an aberrantly spliced, non-functional 
mRNA since a translational frameshift occurs at the newly formed exon 1/exon 3 junction resulting in a 
premature termination codon. 

25 

EXAMPLE 9 

Analysis of Human and Bovine BMPs Expressed in Yeast 

30 hBMP is expressed in yeast as a mixture of three approximately 17KD proteins, whereas bBMP is 
expressed as a mixture of 3 species with molecular weights of 19KD and a 16KD doublet. The size 
heterogeneity is most probably a result of processing of each BMP by yeast encoded enzyme(s) during 
secretion. Amino-terminal amino acid sequencing of the bBMP mixture gave a single amino terminus (Phe), 
indicating that the processing occurs in the carboxyl region of the protein, yielding the 16KD yeast cleavage 

35 analogs. Several paired basic amino acid residues in this region are likely candidates as proteolysis sites 
during secretion. This observation, together with the fact that each BMP cDNA encodes a putative protein 
product of molecular weight greater than 20kD, also underscores the possibility that BMP, in vivo , is the 
product of proteolytic processing from a larger precursor. 

40 



45 



50 



55 
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TABLE 1 



A Summary of the South rn Blot Results Id ntifying Human BMP Ex n-Containing DNA from Genomic 

Clones. 


Exon 


Restriction Fragment 1 


Genomic Clone 2 


Probe 4 


Size (kb) 


Enzyme 


1 


0.6 


Pstl 


Hg4 3 


BMP 103 


2 


0.6 


Pstl 


HG 4 3 


hBMP cDNA 


3 


1.7 


Hind III 


HG9 


bBMP cDNA 


4 


2.0 


Hind III 


HG 9 


bBMP cDNA 


5 


0.4 


Bgl II 


HG 5 


BMP 116 


6 


3.0 


Bgl II 


HG 5 3 


BMP 117 


7 


1.1 


Bgl II 


HG 5 


BMP 104 



1 Exon-containing DNA restriction fragment. 

2 Genomic clone from which restriction fragment was derived. 

3 Exon 1 and 2 DNA's were obtained from a 7 kb Sal I fragment of HG 4 subcloned into pUC 1 9. 
Both exons were found on the same Pst I fragment, but identified with different probes. Exon 6 was 
obtained from a 15kb Sal I insert of HG5 subcloned into pUC19. 

♦ Probe used to identify exon-containing DNA restriction fragment. Probes are described in 
"Oligonucleotide Synthesis" section of Materials and Methods. Hybridization and washing 
conditions are described under Example 3 in the "Screening of the libraries (c) human genomic 
library" section of the examples. 



Claims 

30 1. A DNA fragment comprising a first nucleotide sequence that encodes a polypeptide or a portion 
thereof, wherein the polypeptide comprises an amino acid sequence selected from the group consisting 
of a first amino acid sequence as shown in Rgure 3, comprising amino acid residues numbered 1 
through 180, and a second amino acid sequence as shown in Rgure 5, comprising amino acid residues 
numbered 1 through 182, and wherein the portion possesses a biological activity of the polypeptide. 

35 

2. The DNA fragment according to claim 1 , wherein the amino acid sequence of the polypeptide is the 
second amino acid sequence. 

3. The DNA fragment according to claim 1 , wherein the amino acid sequence of the polypeptide is the 
40 first amino acid sequence. 

4. The DNA fragment according to any one of claims 1 to 3, further comprising a second nucleotide 
sequence that encodes a signal sequence for the first or second amino acid sequence. 

45 5. The DNA fragment according to claim 4 further comprising a second nucleotide sequence that encodes 
an N-terminal yeast alpha-factor signal sequence. 

6. A vector comprising a DNA fragment according to any one of claims 1 to 5. 

so 7. A recombinant host cell comprising a vector according to claim 6. 

8. A method of producing recombinant mammalian bone morphogenetic protein comprising: 

(a) providing a cell that comprises a first DNA sequenc , wherein said DNA sequenc comprises (i) 
transcriptional and translational control sequences functional in said cell, and (ii) a het rologous 
55 coding s quence under the control of said transcriptional and translational sequenc s, wherein th 

heterologous coding sequ nc encodes a polypeptide or a portion th reof, th polypeptide compris- 
ing an amino acid sequence selected from th group consisting of a first amino acid sequence as 
shown in Figure 3, comprising amino acid residues numbered 1 through 180, and a second amino 
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acid sequence as shown in Figure 5, comprising amino acid residues numbered 1 through 182. and 
wherein the portion possesses a biological activity of th polypeptide; and 
(b) growing said cell under c nditions wh r by said polyp ptid is expressed. 

5 9. The method according to claim 8, wherein the amino acid sequence of the polypeptide is the second 
amino acid sequence. 

10. The method according to claim 8. wherein the amino acid sequence of the polypeptide is the first 
amino acid sequence. 

to 

11. The method according to any one of claims 8 to 10, wherein the cell is a microorganism. 

12. The method according to claim 11, wherein the microorganism is a yeast cell. 

is 13b The method according to any one of claims 8 to 12, further comprising a second DNA sequence that 
encodes a signal sequence wherein the signal sequence is disposed at the N-terminal of said 
polypeptide. 

14. The method of claim 13, wherein the cell is a yeast cell and said signal sequence is a yeast alpha- 
20 factor signal sequence. 

15. The method according to claim 11 wherein the microorganism is an E. coli cell. 

16. A DNA fragment comprising a nucleotide sequence that encodes a peptide selected from the group 
25 consisting of peptides A, B, C, D, E, F, G, and H of Figure 1. 

17. A DNA fragment comprising a nucleotide sequence selected from the group consisting of Exon 3 of 
Figure 2A and Exon 4 of Figure 2B. 

30 18. A DNA fragment comprising a nucleotide sequence selected from the group consisting of Exons 1 
through 7 of Figures 4a, 4b, and 4c. 

19. A composition comprising mammalian bone morphogenetic protein, or a portion thereof, substantially 
free of other osteoinductive associated factors, wherein the protein comprises an amino acid sequence 
35 selected from the group consisting of a first amino acid sequence as shown in Rgure 3, comprising 
amino acid residues numbered 1 through 180, and a second amino acid sequence as shown in Figure 
5, comprising amino acid residues numbered 1 through 182, and wherein the portion possesses a 
biological activity of the protein. 

40 Patentanspriiche 

1. DNA-Fragment, umfassend eine erste Nucleotidsequenz, die ein Polypeptid Oder einen Teil davon 
codiert, wobei das Polypeptid eine Aminosauresequenz umfaBt, die ausgewahlt ist aus einer ersten 
Aminosauresequenz gemSB Figur 3, umfassend AmtnosSurereste 1 bis 180, und einer zweiten 

45 AminosSuresequenz gemafi Figur 5, umfassend Aminosaurereste 1 bis 182, und wobei der Teil eine 
biologische Aktivitat des Polypeptids besitzt. 

2. DNA-Fragment nach Anspruch 1 , wobei die Aminosauresequenz des Polypeptids die zweite Aminosau- 
resequenz ist. 

60 

3. DNA-Fragment nach Anspruch 1 , wobei die Aminosauresequenz des Polypeptids die erste Aminosau- 
resequenz ist. 

4. DNA-Fragment nach ein m der Anspruch 1 bis 3, auBerdem eine zw ite Nucleotidsequenz umfassend, 
55 die ein Signals qu nz fur di rste od r zweit Aminosa'ures qu nz codiert. 

5. DNA-Fragment nach Anspruch 4, auBerdem eine zweite Nucl otidsequenz umfassend, die eine N- 
terminale Hef e- Alpha- Faktor-Signalsequenz codiert. 
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6. Vektor. umfass nd in DNA-Fragm nt nach einem der AnsprOch 1 bis 5. 

7. Rekombinante Wirtsz He, umfassend einen Vektor nach Anspruch 6. 

5 8. Verfahren zur Herstellung eines rekombinanten morphogenetischen Knochenproteins vom Sauger, 
umfassend: 

(a) Bereitstellung einer Zelle, umfassend eine erste DNA-Sequenz, wobei die DNA-Sequenz (i) 
Transkriptions- und Translationskontrollsequenzen, die in der Zelle wirksam sind, und (ii) eine 
heterologe codierende Sequenz unter der Kontrolle der Transkriptions- und Translationssequenzen 

10 umfaBt, wobei die heterologe codierende Sequenz ein Polypeptid Oder einen Teil davon codiert, 

wobei das Polypeptid eine Aminosauresequenz umfaBt, die ausgewahlt ist aus einer ersten Amino- 
sauresequenz gemaB Rgur 3, umfassend Aminosaurereste 1 bis 180, und einer zweiten Aminosau- 
resequenz gemaB Rgur 5, umfassend Aminosaurereste 1 bis 182, und wobei der Teil eine 
biologische Aktivitat des Polypeptids besitzt; und 

is (b) ZUchtung der Zelle unter Bedingungen, unter denen das Polypeptid exprimiert wird. 

9. Verfahren nach Anspruch 8, wobei die Aminosauresequenz des Polypeptids die zweite Aminosaurese- 
quenz ist. 

20 10. Verfahren nach Anspruch 8, wobei die Aminosauresequenz des Polypeptids die erste Aminosaurese- 
quenz ist. 

11. Verfahren nach einem der Ansprtlche 8 bis 10, wobei die Zelle ein Mikroorganismus ist. 
25 12. Verfahren nach Anspruch 11, wobei der Mikroorganismus eine Hefezelle ist. 

13. Verfahren nach einem der Ansprtlche 8 bis 12, auBerdem eine zweite DNA-Sequenz umfassend, die 
eine Signalsequenz codiert, wobei sich die Signalsequenz am N-terminalen Ende des Polypeptids 
befindet. 

30 

14. Verfahren nach Anspruch 13, wobei die Zelle eine Hefezelle ist und die Signalsequenz eine Hefe-Alpha- 
Faktor-Signalsequenz. 

15. Verfahren nach Anspruch 11, wobei der Mikroorganismus eine E. coli-Zelle ist. 

05 

16. DNA-Fragment, umfassend eine Nucleotidsequenz, die ein Peptid codiert, das ausgewShlt ist aus den 
Peptiden A, B, C, D, E, F, G und H der Rgur 1. 

17. DNA-Fragment, umfassend eine Nucleotidsequenz, die ausgewahlt ist aus Exon 3 von Rgur 2A und 
40 Exon 4 von Rgur 2B. 

18. DNA-Fragment, umfassend eine Nucleotidsequenz, die ausgewahlt ist aus den Exons 1 bis 7 der 
Rguren 4a, 4b und 4c. 

45 19. Mittel, umfassend ein morphogenetisches Knochenprotein vom Sauger oder einen Teil davon, im 
wesentlichen frei von anderen mit Osteoinduktion im Zusammenhang stehenden Faktoren, wobei das 
Protein eine Aminosauresequenz umfaBt, die ausgewahlt ist aus einer ersten Aminosauresequenz 
gemaB Rgur 3, umfassend Aminosaurereste 1 bis 180, und einer zweiten Aminosauresequenz gemaB 
Rgur 5, umfassend Aminosaurereste 1 bis 182, und wobei der Teil eine biologische Aktivitat des 

so Proteins besitzt. 

Revendications 

1. Fragment d'ADN compr nant un premiere se*qu nee de nucl£otid s codant pour un polypeptide ou . 
55 une portion d polypeptid , dans lequ I I polypeptide comprend un s£quenc d'acides amines 
choisie dans le groupe consistant en une premiere sequence d'acides amines telle que r pr£sent£e sur 
la figure 3, comprenant des r£sidus d'acides amines num£rot£s 1 k 180, et un seconde sequence 
d'acides amines telle que representee sur la figur 5, comprenant des r£sidus d'acid s amines 
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numerates 1 & 182, tou la portion possede un activity biologiqu du polypeptide. 

2. Fragment d'ADN selon la rev ndication 1, dans lequel la sequence d'acides amines du polypeptide est 
la seconds sequence d'acides amines. 

5 

3. Fragment d'ADN selon la revendication 1 , dans lequel la sequence d'acides amines du polypeptide est 
la premiere sequence d'acides amines. 

4. Fragment d'ADN selon I'une quelconque des revendications 1 a 3, comprenant en outre une seconds 
10 sequence de nucleotides codant pour une sequence signal pour la premiere ou la seconds sequences 

d'acides amines. 

5. Fragment d'ADN selon la revendication 4, comprenant en outre une seconde sequence de nucleotides 
codant pour une sequence signal de facteur alpha de levure N-terminale. 

75 

6. Vecteur comprenant un fragment d'ADN selon Tune quelconque des revendications 1 a 5. 

7. Cellule note recombin^e comprenant un vecteur selon la revendication 6. 

20 8. Procede pour la production de proteins morphog6n£tique d'origine osseuse de mammifere recombin^e 
comprenant: 

(a) la miss a disposition d'une cellule comprenant une premiere sequence d'ADN, dans laquelle 
ladite sequence d'ADN comprend (i) des sequences de contrdte transcriptionnelles et traductionnel- 
les fonctionnelles dans les dites cellules, et (ii) une sequence codante necrologue sous te controls 

25 des dites sequences transcriptionnelles et traductionnelles, la sequence codante heterologue codant 

pour un polypeptide ou une portion de celui-ci, le polypeptide comprenant une sequence d'acides 
amines choisis dans le groupe consistant en une premiere sequence d'acides amines telle que 
representee dans la figure 3, comprenant des r£sidus d'acides amines numerous 1 a 180, et une 
seconde sequence d'acides amines telle que representee dans |a figure 5, comprenant des nisidus 

30 d'acides amines numerates 1 a 182, et ou la portion possede une activite biologique du polypeptide; 

et 

(b) la culture de ladite cellule dans des conditions dans lesquelles ledit polypeptide est exprime\ 

. 9. Procede selon la revendication 8, dans lequel la sequence d'acides amines du polypeptide est la 
35 seconde sequence d'acides amines. 

10. Procede selon la revendication 8, dans lequel la sequence d'acides amines du polypeptide est la 
premiere sequence d'acides amines. 

40 11. Procede selon Tune quelconque des revendications 8 a 10, dans lequel la cellule est un microorgani- 
sms 

12. Procede selon I'une quelconque des revendications 8 a 10, dans lequel le microorganisms est une 
cellule de levure. 

45 

13. Procede selon Tune quelconque des revendications 8 a 12, comprenant en outre une seconde 
sequence d'ADN codant pour une sequence signal dans laquelle la sequence signal est situ^e sur 
I'extremite N-terminale du dit polypeptide. 

50 14. Procede selon la revendication 13, dans lequel la cellule est une cellule de levure et ladite sequence 
signal est une sequence signal de facteur alpha de levure. 

15. Procede selon la r v ndication 11, dans lequel I microorganism st un cellule d'E. colL 

55 16. Fragment d'ADN compr nant une s£qu nc d nucleotides codant pour un peptide choisi dans le 
groupe consistant en les peptides A, B, C, D, E, F, 6 et H selon la figure 1 . 
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17. Fragment d'ADN, comprenant une s£quenc de nucleotides choisie dans I groupe consistant en Exon 
3 selon la figure 2A t Exon 4 selon la figure 2B. 

18. Fragment d'ADN comprenant une sequence de nucleotides choisie dans le groupe consistant en Exon 
1 & 7 selon les figures 4a, 4b et 4c. 

19. Composition comprenant une proline morphog6n€tique d'origine osseuse de mammifferes ou une 
portion de celle-ci, pratiquement exempts de tous facteurs osteoinductifs associes, dans laquelle la 
proline comprend une sequence d'acides amines choisie dans le groupe consistant en une premiere 
sequence d'acides amines telle que representee dans la figure 3. comprenant des nSsidus d'acides 
amines numerates 1 & 180, et une seconde sequence d'acides amines telle que representee dans la 
figure 5, comprenant des rSsidus d'acides amines numerates 1 k 182, et ou la portion possede une 
activite biologique de la proline. 
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PEPTIDE A 

VAL ASH ALA LEU ASP CLU ASP SER LEU THR MET ASP LEU CLU PHE ARC 
PROBE A 

3* CTC AAT CCC CTC CAT CAC CAC TCC CTC ACC ATC CAC CTC CAC TTC CC 3* 
T f T T 

3' CTC AAT CCC CTC CAT CAC CAC ACC CTC ACC ATC CAC CTC CAC TTC CC 3' 

T f f f 

PEPTIDE B 

CLU SER CLU ALA ASP PRO AU THR CYS ASP PHE CLN ARC 
PROBE B 

3* CAC TCC CAC CCT CAC CCT CCC ACC TCC CAC TTC CAA CC 3' 

T ¥ f c 

3* CAC ACC CAC CCT CAC CCT CCC ACC TCC CAC TTC CAA CC 3' 

T T T C 

PEPTIDE C 

MET SER ALA CLU CLN VAL CLN ASM VAL TRP VAL ARC 
PROBE C 

3 f ATC TCT CCT CAA CAA CTC CAA AAC CTC TCC CTC CC 3 f 
C C C T 

3' ATC ACT CCT CAA CAA CTC CAA AAC CTC TCC CTC CC 3 1 
C C C f 

PEPTIDE D 

CLY TYR HIS VAL FRO VAL ALA VAL CYS ARC 
PROBE D 

3* CCC TAC CAC CTC CCT CTC CCT CTC TCC AC 3' 
T f T C 

ADDITIONAL TRYPTIC PEPTIDES 

PEPTIDE C 

CLY CLU PRO LEU TYR CLU PRO SER ACR 
PEPTIDE r 

CLU ALA LEU SER ALA SER VAL ALA LYS 
PEPTIDE C 

VAL ASH SER CLU SEE LEU SEE PRO TYR 

PEPTIDE B 

ASN SER TYR LEU LEU CLY LEU THR PRO ASP ARC 



B 3 I 
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Two bovmc eyons 



AxgProI^uProl^uValPhel^uLy^alAsriM 
ThrProLeuAlaProCysPheSerGluG LyGlnArgProGlyArgGlyGInLeuAspHls 



AsnAlaProCysProLeuPhePheOP Ai jSerThrProTrpThrArgThrAlaOP Pro 

• AACCCCCCTTGCCCCTTCTTTTTCTCAAG zrr^rarrr rrMrnxnnxcxGCTTGXcr^ 

• TTGCGGGGMCGGGGAACAAAAAGACTTCFACTTCCGGGACCTGCTCCTG 



GlyLeuArgValGlsAspSerArgAspAspVaiGlnGluGlylleOP ClyArgProArg 
TrpThrAM SerSezGlyPheLysArgAr^ArgAlaGlyGlyAAnLeuArgGlnThrPro 

TYift AfTT Afi AftTTT X(tCX ATTT AAA AC AgfiAgGTG£AfVS ACfVi AATPTYSACCf^Afi AfygftS 
ACCTGAATTTTAAftTrf^AAftTTCTCTCCTG^ 
— D I 

HisLeuOP l^uPzt3GluGXyl^uPro^9C2lyGl\iCysGlyAlaXysAlaThr5erLeu 
ProProVaIThrS«rAxgGlyAlaThrthrT*OP ValTrpGlyLysGlyHlsClnPro - 
CCACCTGTGACTTCCAGAGGGGCTACCACGT&TGAGTG1GGGGGAAAGGCCACGAGCCT * * < 
GGTGGACACTGAAGGTCTCCCCGATGGTGCAdCACTCACACCCCGTTTCCGGTGGTCGGA * ♦ « 



Excn 

3 



;iJpi 



Pt>Ka<, p 

SerSerPTOPhcLcuI^uPheGlnProValAlaValCv^ 
GluLeuThrlleSexProPhftPrcJUaArgSIy^ 
ClyAlaHisHisPhcSerPheSerSrfrProTrpProPhcAiaGluAlaProCysGlyCys 
» GGAGCTCACCATTTCTCCTTTTCCA<fcCOT 

• CCTCGAG TGGTAAAGAGGAAAAGGTCGGGCACCGGCAAACGTCTTCGTWCACGC 
foerf.de. C | 

AlaGluGlnValGJjiA^ValTrpValAxq^ 
CysOP ThrOlyAlaGluAxgvaliiyserLeuProLwVall^GlnLeuTrpValGln 
LeuI^uAsaAxgCyjArgThrCysGlyPheAlaAlaThzGlyProProAJal^GlyPro 
CTGCTGAAO^TGCAGAJUttTGItraTTC 

GACGACTTGTCCACGTCTTGCAGACCCAAGCGACGGTGACCAGGAW 
SerSerGluGli ValCyaThxGlyAlaCyrthrGlyValHisAr 



GlnGlaOP 



gTjrrAlaProHlaPro 
sArgCysAlaGlnValcysThrPzoter 
GlaValc "* "* 



E*Cr\ 



ArgC lyMatHisGlyGlyLauHi* 
AlaAlaValLysAx 9TyrAlaArgGlyLft\iAlaGInValCyaThzGlyHdtHisProli« 
GCAGCAGTGAAGAG STATGCACGGGGGCXTCCACAGGTGTGCACAGGXATGCACCCCATC * » * 
CGTCGTCACTTCTC CAXACGTGCCCCCGAACGTGTCCACACGTGTCCATACGTGGGGIAG • • • 
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1 TGATAAACAGCTGCTTTCAGGACAACTGGTCAGCCCCAAAGGCACACAGACAATCTCCCT 
ACTATTTGTCGACGAAAGTCCTGTTCACCAGTCGGGGTTTCGGTGTGTCTGTTAGAGGGA 

-I* 

MetGluLysMetAlaMetLysMetLeuVal 
61 ATCTCTGGCACGGAAATTGTTCTTCCCATAATGGAGAAGATGGCGATGAAGATGTTGGTG 
TAGAGACCGTGCCTTTAACAAGAAGGGTATTACCTCCTCWkC(^CTACTTCTACAACCAC 

IlePheVall^uGlyMetAsnHisTrpThrCysThrGlyPheProValTyrAspTyrAsp 
121 ATATTTGTCCTTGGAATGAACCACTGGACTTGTACAGGTTTCCCGGTGTATGACTATGAC 
TATAAACAGGAACCTTACTTGGTGACCTGAACATGTCCAAAGGGCCACATACTGATACTG 

ProAlaSerLeuLysGluAlaLeuSerAlaSerValAlaLysValAsnSerGlnSerLeu 
181 CCGGCTTCCCTGAAGGAGGCTCTCAGCGCCTCTGTGGCAAAAGTGAATTCCCAGTCACTG 
GGCCGAAGGGACTTCCTCCGAGAGTCGOGGAGACACCGTTTTCACTTAAGGGTCAGTGAC 

SerProTyrLeuPheArgAlaPheArgSerSerValLysArgValAsnAlaLeuAspGlu 
241 AGCCCSTATCTGTTTCGGGCGTTTAGAAGCTCAGTTAAAAGAGTCAACGCCCTGGACGAG 
TCGGGGATAGACAAAGCCCGCAAATCTTCGAGTCAATTTTCTCAGTTGCGGGACCTGCTC 

AspSerl^uThrMetAspLeuGlvxPheArglleGlnGluThrThrCysArgArgGluSer 
301 GACAGCTTGACCATGGACTTAGAGTTCAGGATTCAAGAGACGACGTGCAGGAGGGAATCT 
CIGTCGAACTGGTACCTGAATCTCAAGTCCTAAGTTCTCTGCTGCACGTCCTCCCTTAGA 

GluAlaAspPraAlaThiCysAspPheGlnAx^lyTyrHisValProValAlaValCys 
361 GAGGCAGACCCOGCCACCTGTGACTTCCAGAGGGGCTACCACGTGCCCGTGGCCGTTTGC 
CTCCGTCTGGGGCGGTGGACACTGAAGGTCTCCCCGATGGTGCACGGGCACCGGCAAACG 

ArgSerThrValArgMetSerAlaGluGlnValGlnAahValTrpValArgCysHisTxp 
421 AGAAGCACCGTGCGGATGTCTGCTGAACAGGTGCAGAACGTGTGGGTTCGCTGCCACTGG 
TCTTCGTGGCACGCCTACAGACGACTTGTCCACGTCTTGCACACCCAAGCGACGGTGACC 

SerSerSerSerGlySerSerSerSeiGluGluMet PhePheG lyAspI leLeu GlySer 
481 TCCTCCAGCTCTGGGTCCAGCAGCAGTGAAGAGATGTTTTTTGGGGATATCTTGGGATCC 
AGGAGGTCGAGACCCAGGTCGTCGTCACTTCICT A CAAAAAACCCCTATAGAACCCTAGG 

SerThrSerATgAsnSerTyrl^ul^uGlyl^ullirPrc^pArgSerArgGlyGluPro 
541 TCTACATCAAGAAACAGTTACCTGCTTGGCCTCACTCCTGACAGATCCAGAGGTGAACCA 
AGATGTAGTTCTTTGTCAA1GGACGAACCGGAGT6AGGACTGTCIAGGTCTCCACWGGT 

l*uTyxGluProS«wArgGluM«tArgArgA*nPl^ 
601 CTTTATGAACCATCACG16AGATGAGAAGAAACTTTCCTCTTGGAAATAGAAGGTACTCG 
GAAATACTTGGTAGTGCACTCTACTCTTCTTTGAAAGGAGAACCTTTATCTTCCATGAGC 

\f 

AsnProTrpProArgAlaArgValAanProGlyPheGluOP 
661 AACCCGTGGCCCAGAGCAAGAGTAAACCCTGGCTTTGWIGACAGCCT^ 

TTGGCCACCGG G TCTOGTTCT C ATTTGGGACCCAAACTCAC1GTCGGAATTCGTTTIA0G 

721 ACTGGAAGGAATAGAAGTTCCAATGAAGAAA^ 

tgaccttccttatcttcaaggttacttctttctatggaatacttaacacattaaaagaaa 
781 tgatcaattgoustccctaataaatggcttacttttcc 

ACTAGTTAACGTCAGGGATTATrrACCGAATGAAAAGG 



25 



EP 0 409 472 B1 



LeuArgLysAsnCysKisProGlnThrHisArgGluThrLeuSerValSerArgZJeuGln 
SerAM GluGluLeuSerSerProAsnThrAM ArgAspThrLeuCysLeuSarlleThr 
I^ul^uGlyAxgThrVallleProLysHisIleGluArgHisSerLeuSerLcuAspTyr 
CTCTTAGGAAGAACTGTCATCCCCAAACACATAGAGAGACACTCTCTGTCTCTCGATTAC 
GAGAATCCTTCTTGACAGTAGGGGTTTGTGTATCTCTCTGTGAGAGACAGAGAGCTAATG 

SerOP PhftProGluTrpAroAroOP AroOP OP OP AroTyrOP LeuCysLeuLeu 
Il^ft^ilftSerAraMgtciuLv^tThry fttMetMetLvsllel^uIleMetPheAIa 
AsnHisAspPheGlnAsnGlyGluAspAspAspAspAspCluAspIleAspTyrValCy* 
AATOTGATTTCCAGAATGGAGAAGATGACGAIGATGATGAAGATATTGATTATGT^ 
TTAGTACTAAAGGTCTTACCTCTTCTACTGCTACTACTACTTCTATAACTAATACAAACG 



__ T^uftl iiOP ThrThrOlvLguAl aff^alAroTprSarProThrTrpProProAlaLeu 
LeuGlvMetAsnTvrTrpSerCysSerGa yLysValPheThrAsnl^uAlaThrCysSer 
SerTrpAsnGl uLeuLeuVaiLauLeuArwOC GlylleHisGlnProGlyHlsLeuLeu 
TCTTGGAATGAACTACTGGTCTTGCTC^ 
AGAACCimCTTGATCACCAGAACGAGTra 

A5pHisAlaGluProCysTrpArgL6uCysLeuValSerteuCy5ProMetCysLeuArg 
GlySefCysArgAlaMetI*uAlaProValSerCy$I*uIh^^ 
TrpIleMetGlnSerHisAlaGlyAlaCysvalLeuSerHisCysAlaProCysAlaCys 
TGGATCATGCAGAGCCATGCTGGCGCCTGTGTCTTGTCTCACTGTGCCCCATC 
ACCTAGTACGTCTCGGTACGACCGCGGACACAGAACAGAGTGACACGGGGTACACGAACG 

ValGl JyalSerGlnCvsThrThrThrlleHisProProOC GlvMetProSerValPro 
CysProGp.yPheProValTvrA3DTyrAspProSer£erI^uArq^pAlaLcuSerAla 
ValSerArgPheProScrValArgLcuArgSerllel^xiLeuLysGlyCysProGlnCy* 
GTGTCCAGGTTTCCCAGlGTACGACTACGATCCATCCTCCTTAAGGGATGCCCTCAGTW 
CACAGGTCp^GGGTCACArc 

LeuTrpQC LysOP IleProSerHisOP ValArqlleCysPheGlyHlsSerGluAla 
SerValVal LysValAsnSerG lnSerl^ 
LeuCyaGlyLysSerGluPheProValthH^^ 

CTCTGTGGTAAAAGTGAATTCCCAGTCACTGAGTCCGTATCTGTTTCGGGCATTCAGAAG 
GAGACACCATTTTCACTTAAGGGTCA^ 

HisOC LysGli iOC ValGlnA*nGluIlePheS«rThrProProSerA«nAlaValCy« 
SerLeuLysArg ZalSerAlaLysOP AsnLsuLeuTyrSerSerPheGlnCysCysLeu 
LeuIleLysLyss ^LysCy»LyiWatLy*SerSarLauLauLeuLauPrcMetLauS«r 
CTCATTAAAAAGA JTAAGTGCAAAATGAAATCTTCTCTACTCCTCCTTCCAATGCTGTCT 
GAGTAATTTTTCI IATTCACGTTTTACTTTAGAAGAGATGAGGAG6AAGGTTACGACAGA 





4cu 
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SerllelleAsnPheArgAsnAIaLeuIleThrLeuOP AsnPhellePheValLyatfal 
AM HisHisLysLeuGlnLysCysZleAspHisThrLeuLysLeuTyrPheCysGluG ly 
IleAlaSerOC ThrSexGluMetHi*OP SerHisSexGluThrLeuPheLeuOP Ai j 
AIAGCATCATAAACTTCAGAAATGCATTGATCACACTCTGAAACTTTATTTTTGTGAAG 3 
TATCGTACTATTTGAAGTCTTTACCTAACTACTCTGAGACTTTGAAA7AAAAACACTTC : 



?Jl5IyPr5Sxg^ E*on 
LeuArgSerAM MetArgThrThrTtpSarOP IlcAM SarSarAlaSarGlyArgteu . 
TTGAGGTCCTAGATGAGMCAACTTGGTCATGAATTTAGAGTTCAGCATC^ J) 
AACTCCAGGATCTACTCTTGTTGAACCACTACTTAAAT^ 

(^9ArqLvgA«t»fi*T<SlvCluAgi>ProAlaTh^ 
MetGlnGluGlyPheTrpArgArgSerArgTyr^ 
HisAlaGlyArgIleI*uGluLy»IleProI«uHisValPrc^ 
CATGCAGGAAGGATTCTGGAGAAGATCCOGCTACATGTGCCTTCXAGAGGGACTA 
GTACGTCCTTCCTAAGACCTCTTCTAGGGP^TCTACACGGAAGGTCTCCCTGATGA 

talSeiGlyArgArgProIlePx^luMetAsnLys^ 
GJLyLysTrpGluGluThrHisProArgA«DGluGlnLysGluGluProHisPh€PheHis 
TrpOC ValGlyGlyAspProSarGlnLysOP ThrLysGlyArgAlaSerLeuLeuPro 
TGGTJ^GTGGGAGGAGACCCATCCCAGAAATGAACA 

ACjpATTCACCCTCCTCTGGGTAGGGTCTTTACriXj TTT TCC TTCTCGGAGTGAAGAAGGT 



XI ac lufil uArgThrH t <PrePhaValPh^MS ^Pr^ln^uPheAl aGluAlaPro 

_ BPrSvSt 

CAGC^AAGAGAGAACTCACCCCTTTGTCTTTTC 



SerOP Ar^CluAanS^PreLauCvsLctiPheGl nS^JurAlaValCy 

- - -iiHisSercysi^euGixiLysHia 



GlnLeuLysArgGluLeuThrPrdLetiSerPhePrbV 

CAGCTGAAGAGAGAACTCACCCCTTTGTCTTTTCCAGj- _ 

^TCGACTTCTCXCTTGAGTGGGGAAACAGAAAAGGTAGGTGTOGACAAA fc 

ArgGluGlyliaCyaProAlaGly^ 

CGTGAAGGTATCTGCCCAtXAGCTGCAGGCGGTGCATGCTOGCTGCAGCTG^ 
GCACTTCCATAGACGGGTCGTCCACGTCCGCCAOGTACGAGCGACGTCGA 

ThrSeiGluSftrrvrS^rSarGluGluy alOP LeuGlyPro 
HisvSlOP ^li^^loflliJ^Arg^yMetThrGlyA lai^ 
CACGTCTGAGTCTTACAGCAGCGAAGAGpTATGACTGG GGCC TTG 
GTGCAGACTCAGAATGTCGTCGCTTCTCfcATACTGACCCCGGAAC 



Figure 
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PhePheLeuSerCysAlaAspThrSerOP CysLeuAsnPhePheAr iQP PhaLeuGly 
PheLeuSerPheMetCyaOP HialleLeuMetProGluPheLeuAM tetllePhcGly 
PheSerPhePheHlsValLouThrHisPraAspAlaOP UeSerLeu* spAspPheTxp 
TTTTCTTTCTTTGA2GTGCTGACACATCCTGATGCCTGAATTTCTTTAG \TGATTTTTGG 
AAAAGAAAGAAAGTACAOGACTGTGTAGGACXACGGACTTAAAGAAATC rACTAAAAACC 



ThrTyflTrpArot^iTl^Xjinfil »C1 uThyTI »T1 ^PyyT^n falSerOC AapPXOGly 



,VT^^Lnig 1 ^^ HigLva ^f^^^P , ^^^gC LyLyaLeuArgProArg 
SlyHlsValGlyXleSexOC MetGluLyaGlnLeuSerlleTr ?OC ValLysThrPro 

^-'AACTTAAGAOCCOG 



GlyHlsValGlyXleSexOC MettluLysGlnLeuSerZleS ?OC 

ggacatgttgggatctcataaatggagaaagaattatctatttgpt; 
cctgtacaaccctagagtatttagctcttt g ' 



TTAATAGATAAAC 2ATTCAATTCTGGGGC 



5* 



GluGluSexCysLeuMotllelleThrValLcuGln ralSftrPfteGlnThrS 



TTyrCysValThrG LyI<omieS«A9Pg; 



ArgGlyValLeuSerTyrAspTy 

GluArgSerLeuValLeuOP I^iiI^uLeuCyaTyrArfrSarHlflPheArgArgValHla 
GAGAGGAGTCTT G TCTTATGATTATTACTGTGTTACAG HCTOiTTTCAGACGAGTCCAT 
CTCTCCTCAGAACAGAATACTAATAATGACACAATGTC ZAGAGTAAAGTCTGCTCAGGTA 

vul AflnAmPheM^TlrflvHlal^al^AflDPhrfhrfi^aValProPrQAflp 



LysOP ThrlleLeuOP 



SerLeuG LyLyaOP 



alThrTr 



SerPropC 



AAGTGAACAATTTTATGAICGGTCACTTG 5TAAGTGA' 
TTCACTTGTTAAAATACTAGCCAGTGAAC lATTCACTAAAGAAAGGACGACACGGTGGTC 



PheLeuSarCysCysAlaThrArg , 
>OC VallleSerPheLeuLeuCysHisGln 

kTTTCTTTCCTGCTGTGCGACCAG 



SerOP CyaAapOP ClnScrLeuCyaPhcProPhcLeuSerLeuG LvIleM 
IleLeuMetOP LeuThxGluProMetLauProPheSexIlePheA: 
Aa pI^iAA spValThrAapArgAlaTyr AlaSarLeuPh eTyrLeiiAM 
GATCTTGATGTQACTGACAGAGCCTASGCTTCCCTTTTCXAr 
CTAGAACTACACTGACTCTCTC G GAgACGAAGGGM 



uSerLeuG LvIleMetArqAra 
IlePheAr jAspkisGluLya 
TLeuAM SlySerCP Glu 
lTCTTTAG 3GATCATGAGAA 



(HyXXeAlaSerTr^yiG^^ 

"lyTyrCyaLttuLeuGluThzGluGlyThrGloThrThrGlyT^ lie 

AmPhgfiiuoe ArgPxoOP GlyLyaLyaKetGlnValBiaThrSexV^aPheAxgAn 
OP LeuOP ValThrAlaLeuArgOC GluAsnAlaGlyAlaHiaLyaCyalleAM Lya 
I^uThrLeuSerAaoGlyLeuGluValArgLyaCyaAxt^ 
CTGACTTTGAGZAAOGGCCTTGAGGTAAGAAAATGCA^ 
GACTGAAACTCATTGOOGGAACTCCATTCTTT TA OGTCCAOGT G T G TT CA CAT^ 



7 
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Hum* * BMP cWA 



-a* 

MetIleSerArgMetGluLysMetThrMetMetMetLysIleI«uIleMetPhe 
1 ACAATCATGATTTCCAGAATGGAGAAGAT6ACGATGATGATGAAGATATTGATTATGTTT 
TGTTAGTACTAAAGGTCTTACCTCTTCTACTGCTACTACTACTTCTATAACTAATACAAA 

Al aLeuGl yMetAsnTyrTrpSerCy sSerGlyPheProValTyrAspTyr AspProSer 
61 GCTCTTGGAATGAACTACTGGTCTTGCTCAGGTTTCCCAGTGTACGACTACGATCCATCC 
CGAGAACCTTACTTC^TGACCAGAACGAGTCCAAAGGGTCACATGCTGATGCTAGGTAGG 

SerLeuArg AspAl aLeuScrAl aSerValValLysVa LAsnSerG InSerLeuSerPro 
121 TCCTTAAGGGATCCCCTCAGT6CCTCTGTG6TAAAA6T6AATTCCCAj6TCACTGA6ICC6 
AGGAATTCCCTACGG6AGTCACGGACACACCATTTTCACTTAAGGCTCAGTGACTCAGGC 

TyrLeuPheArgAlaPheArgSerSerLeuLysArgValGluValLcuAspGluAsnAsn 
181 TATCTCTTTCGGGCATTCAGAAGCTCATTAAAAAGAGTT6AGGTCCTAGAT6AGAACAAC 
ATAGACAAAGCCCGTAAGTCTTCGAGTA A ITTITCTCAACTCCAGGATCTACTCTTGTTG 

ZAuValMetAsnl^uGluPheSerlleArgGluThrThxCysArgLyaAspSerGlyGlu 
241 TTGGTCATGAATTTAGAGTTCAGCATCCGGGAGACAACATGCAGGAAGGATTCTGGAGAA 
AACCAGTACTTAAATCTCAAGTCGTAGGCCCTCT G TT G IACGTCCTTCCTAAGACCTCTT 

AspPraAlaThrCysAlaPhttGlnArgAspTyxTyzValSerThrAlaValCysArgSer 
301 GATCCCGCTACATGTGCCTTCCAGAGGGACTACTATGTGTCCACAGCTGTTTGCAGAAGC 
CTAGGGCGATGTACACGGAAGGTCTCCCTGATGATACACAGGTGTCGACAAACGTCTTCG 

ThrValLysValSerAlaGlnGlnValGlsGlyValHisAlaArfCysSerTrpSerSer 
361 ACCGTGAAGGTATCTGCCCAGCAGGTGCAGGGOGTGCATGCTCGCTGCAGCTGGTCCTCC 
TGGCACTTCCATAGACGGGTCGTCCACGTCCCGCACGTAOGAGCGACGTCGACCAGGAGG 

SerThrSezGluSerTyrSerSerGluGluHetllePheGlyAspMetLeuGlySerHis 
421 TCCACGTCTGAGTCTTACAGCAGCGAAGAGATGATTTTTGGGGACATGTTGGGATCTCAT 
AGGTGCAGACTCAGAATGTCGTCGCTTCTCTACTAAAAACCCCTGTACAACCCTAGAGTA 

LysTrpArgAsnAfinTyxLeuPheGlyLeuIleSerAapGluSerlleSerGluGlnPhe 
481 AAATGGAGAAACAATTATCTATTTGGTCTCATTTCAGACGAGTCCATAAGTGAACAATTT 
TTTACCTCTTTGTTAATAGATAAACCAGAGTAAAGTCTGCTCAGGTATTCACTTGTTAAA 

TyrAspArgSerI«uGlyUeMetArgArgVair*uProProGlyA»nAxgArgTyrPro 
541 TATGATOGGTCACTTGGGATCATGAGAAGGGIATTGCC T CCTG GA AACAGAAGGTACCCA 
ATACTAGCCAGTGAACCCTAGTACTCTTCCCAIAACGGAGGACCTTTGTCTTCCATGGGT 

AsnHlsArgHlaArqAlaArglleAanThrAs pPheG luOC 

601 AACCACCGGCACAGAGCAAGAATAAATACTGACTTIGAGTAAOGGCCTTGAGGT 
TTGGTGGCCGTGTCTCGITCTTATTTA1GACTGAAACTCATTGCCGGAACTCCA 
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